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Effects of Day-to-Day Load Variation on Trunk 
Group Blocking 


By A. KASHPER, S. M. ROCKLIN, and C. R. SZELAG 
(Manuscript received September 4, 1981) 


Modern trunking theory recognizes the need to account for day-to- 
day load variation when sizing a trunk group for an average blocking 
objective. This paper investigates the effects of high levels of load 
variation on average blocking, the measure of service used for sizing 
final trunk groups in the Public Switched Network. Specifically, we 
identify a curious phenomenon in which high day-to-day variation 
results in low average blocking and characterize the traffic theoretic 
models for which this occurs. By a similar analysis, we also investi- 
gate the behavior of an alternate measure of service, the probability 
of blocking, measured by the ratio of the number of unsuccessful 
attempts to the total number of attempts. 


|. INTRODUCTION 
1.1 Background 


An important class of trunk groups in the Public Switched Network 
consists of those groups that provide the last-choice route for a call 
trying to reach its destination. The performance of such a “final trunk 
group” is defined to be the 20-day average blocking during the chosen 
busy hour of the busy season. In the Bell System, the final groups are 
sized for an objective of one percent average blocking (B.01) in the 
busy season. 

Figure 1 shows the history of the measured busy-hour loads offered 
to a trunk group over consecutive days. The degree of variability of 
the load measurements is not consistent with the hypothesis that the 
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Fig. 1—Busy-season load variations. 


daily busy-hour load is constant; rather, it varies from day-to-day in a 
somewhat random fashion. Modern teletraffic theory has recognized 
that this day-to-day variability in the offered load must be considered 
when determining the number of trunks required for an objective level 
of average blocking. It is generally believed that this variability tends 
to increase a group’s average blocking, and therefore, its trunk require- 
ment, over that observed during constant load conditions. 


1.2 Motivation 


Recent analyses of Bell Operating Company traffic data have re- 
vealed that levels of day-to-day load variation, much higher than those 
considered in current trunk engineering practices, occasionally appear 
in the network. To study the effects of such traffic on network service 
and trunk requirements, the currently deployed traffic models were 
extended into this region of high load variability. The results are 
illustrated in Fig. 2, which shows, for Poisson traffic with a fixed mean 
offered load, a, and number of trunks, c, the average blocking B as a 
function of the variance, v, of the daily offered load. This quantity, 
called the day-to-day load variation, is usually parameterized by the 
variable ¢, with v = 0.13a@° [1, 2]. In current engineering practices, ¢ is 
assumed to vary between 1.0 and 1.84. The graph shows that B 
increases with increasing load variation, as expected, but only up to a 
certain point. Beyond that point, B decreases monotonically to zero. 
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Fig. 2—Average blocking versus day-to-day variation. 


Thus, the model predicts that a call arrival process with a highly 
variable daily load will have lower average blocking than one with a 
constant daily load. 

The purpose of this paper is to analyze and explain this counter- 
intuitive phenomenon and to determine its implications on current 
service objectives and traffic models. 


1.3 Overview 


Section II reviews the model of day-to-day load variation currently 
deployed in Bell System trunk engineering practices. We then define 
the average blocking service criterion used for sizing final trunk groups 
and compare it to an alternate measure of service, the probability of 
blocking. In Section III, we analyze the behavior of the two measures 
of service under conditions of high day-to-day load variability and 
compare numerically their properties in different regions of engineer- 
ing interest. Section IV summarizes our results and discusses the 
implications of our findings on trunk engineering practices and network 
service objectives. 


ll. TRAFFIC MODELS 


This section reviews the model of day-to-day load variation that, in 
conjunction with models describing the within-hour call arrival proc- 
ess, is used to predict the relationship between trunk group size, 
average load, and average blocking. 
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2.1 Average blocking 


The discovery of the effect of day-to-day load variation on average 
blocking is credited to a 1958 study by Wilkinson,® who proposed a 
mathematical model‘ that is now the basis for many of the Bell 
System’s trunk engineering practices. 

In Wilkinson’s model, the load, a, offered to a group of c trunks 
within a time-consistent hour (e.g., 9 am to 10 am) varies from day to 
day in a random fashion. Specifically, the daily loads are modeled as 
independent random variables with a common gamma distribution 
I'\(a|@, v) with mean @ and variance v. We assume further that within 
each hour, the call arrival process is Poisson and that blocked calls do 
not retry. The daily blocking probability for a trunk group with c 
trunks and offered load a is defined by the Erlang blocking function, 
B(c, a); the average daily blocking probability, denoted B, is given by 

B = B(c, G, v) = [ Bic, a)dT(a|4@, v). (1) 


0 


2.2 Other considerations 


Wilkinson’s model (1) has been used within the Bell System to 
account for day-to-day load variation in the sizing of final trunk groups. 
Current trunk engineering practices also account for (i) the non- 
Poisson or “peaked” nature of overflow traffic,’ and (ii) the finiteness 
of the one-hour measurement interval within which traffic measure- 
ments are collected.’ However, the consideration of these additional 
factors affects only the choice of the daily blocking function in (1) but 
not the results of this paper, which hold for any single hour blocking 
function of practical interest. 


2.3 Probability of blocking 


In the next section, we will compare certain properties of the average 
blocking service measure, B, with those of another standard service 
measure, the probability of blocking, defined below. 

Let 


o(c, a) = aB(c, a) (2) 


denote the overflow from a trunk group with c trunks and offered load 
a, and let 


o= | aB(c, a)dT(a|a, v) (3) 
0 


denote the average overflow. We define the probability of blocking, 
Pz, as the ratio of the average overflow to the average offered load: 
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a 
I 


Ql als 


| aB(c, a)dT(a|q@, v). (4) 
0 


Thus, Pg is simply the probability that an arriving call is blocked. 
Comparing eqs. (1) and (4), we note that B is an unweighted average 
of the daily blockings, whereas Pz is a weighted average in which the 
daily blocking is weighted by the daily load.* 
Next, we investigate the behavior of both B and Pz under conditions 
of high-load variability. 


lil, AVERAGE VERSUS PROBABILITY OF BLOCKING 


In this section, we develop analytical results that validate and 
explain the high-variation, low-blocking phenomenon observed in Sec- 
tion I. We begin by relating the asymptotic behavior of B to the 
properties of the assumed daily load distribution. 


3.1 Asymptotic behavior of B 


First, let us generalize the definition of average blocking given in (1) 
to the case in which the daily loads are independent, nonnegative 
random variables with a common distribution function F(a). To sim- 
plify notation, let B(a) denote the daily blocking probability on a 
particular trunk group expressed as a function of its offered load a. 
Then, the unweighted average blocking is the quantity 


s~| B(a)dF(a). (5) 
0 


Clearly, (5) coincides with Wilkinson’s model (1) when B(a) is the 
Erlang B blocking probability and the daily loads are gamma-distrib- 
uted. By defining the average blocking in the more general form, we 
can investigate the role of both the blocking function B(a) and the 
load distribution F(a) in determining the behavior of B. 

Let @ and v denote the mean and variance of the offered ead 
distribution. Our first goal is to find general conditions on F(a) under 
which B — 0 as v/a, the coefficient of variation of F(a), increases. 
Our second goal is to show that, for a very general class of blocking 
functions B(a), the commonly assumed gamma distribution satisfies 
the required conditions on the load distribution. Thus, by analysis we 
will both verify and explain the high-variation, low-blocking behavior 
described in Section I. 

To give a precise answer to the first question posed above, we first 
define the general class of blocking functions to be considered. A real- 
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valued function B(a) will be called a blocking function if it has the 
following four properties: 

(B1) B(0O) =Oand Bia) >0ifa>Q0; 

(B2) Bis continuous; 

(B3) Bis non-decreasing; and 

(B4) Bis bounded. 

Let {F;,} be a sequence of probability distribution functions concen- 
trated on [0, ©) and consider the sequence 


B.- | B(a)dF;,(a). (6) 
0 


In Appendix A, we derive simple, necessary and sufficient conditions 
on the distribution sequence {F;,} under which B; — 0. Specifically, 
this will occur if and only if 


lim F,(a)=1 forall a>O. (7) 


The sufficiency of (7) follows from a standard convergence theorem 
(See Ref. 6, p. 249). A simple, direct proof of both the necessity and 
sufficiency of (7) is given in Appendix A. 

Condition (7) says that all quantiles of the load distribution converge 
to zero. Equivalently, this means that all of the mass of the distribution 
converges toward the origin. Note, however, that (7) does not imply 
that the moments of the distributions (e.g., mean, variance) converge 
to zero. 

Using this result, we can now analyze the effect of high day-to-day 
load variation on average blocking in the case of gamma-distributed 
daily loads. 

Let {IT,(a)} denote a sequence of gamma distribution functions with 
mean a; and variance vz. According to our result, the average blocking 
B, = fo B(a)dT;(a) converges to zero if and only if 


lim T,(a) =1 


for all a > 0. In Appendix B, we show that this occurs when the 
coefficient of variation of I;(a), Vun/ Ga, increases without bound. In 
particular, if @ = @ is fixed and v; —> ~, the average blocking B, — 0 
for arbitrary c > 0. Moreover, the number of trunks required to 
guarantee one percent blocking, B.01, also tends to zero if @, = @ and 
Ur — ©. 

Thus, the results of this section give theoretical explanation for the 
phenomenon observed in Section I. 


3.2 Asymptotic behavior of Pg 


The results of the previous section can also be used to analyze the 
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asymptotic behavior of Pg, the probability of blocking, which we 
defined in Section II. 
Recall that the probability of blocking can be expressed as 


pases | o(a)dF(a), (8) 
a 0 


Qs 


where o(a) is the daily overflow load and F(a) is the distribution of the 
daily offered load. 
Let u(a) = a — o(a) denote the daily carried load. Then 


o=a-— i, (9) 


where WU = fo u(a)dF(a) is the average carried load. Thus, 


P; =——. (10) 





We can now analyze the effect of day-to-day load variation on the 
probability of blocking by studying the behavior of wv. To do this, we 
first note that the function u(a) has the following properties: 

(ul) u(0) =0 and u(a) > 0 fora>Q0; 

(u2) wu(a) is continuous; 

(u3) wu(a) is non-decreasing; and 

(u4) u(a) is bounded (by the number of trunks in the group). 

Thus, the carried load u(a) has the required properties of the 
“blocking function” and our result of Section 3.1 can be applied. That 
is, we know that uw — 0 if and only if the load distributions converge as 
in (7). If Z— 0 and @ is fixed, then 


Pose =, (11) 
a 





In particular, if the daily loads are gamma-distributed, the probabil- 
ity of blocking converges to 1 as the coefficient of day-to-day variation 
increases with the mean held constant. 

These results illustrate dramatically a fundamental difference be- 
tween the average blocking (B) and probability of blocking (Ps) service 
criteria under conditions of highly volatile network loads. For ex- 
tremely high levels of day-to-day load variation, the average blocking 
measure takes on low values, indicating good service, even though 
most of the traffic is blocked (Pg ~ 1). Under such conditions, the 
unweighted average blocking is a poor indicator of the service experi- 
enced by the customer. 


3.3 Comparison of B and Ps 
In addition to the asymptotic results described above, we can also 
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analyze the general relationship between B and Pz. Specifically, we 
will show that Pz; = B regardless of the load distribution. 

To see this, note that our assumption that the blocking B(a) is a 
nondecreasing function of the offered load implies that for any load 
distribution F(a): 


| (a — a)B(a)dF(a) = | (a — a)B(a)dF(a) 
0 0 


a | (a — @)B(a)dF(a) = i (a — a)B(a)dF(a). (12) 


Equivalently, 


00 


| aB(a)dF(a) = <| B(a)dF(a), 
0 


0 


or 


Pz;=-= B. (13) 


Qs 


Next, we complement these results with a few numerical examples 
that illustrate the differences between B and Pz. 


3.4 Numerical examples 


Recall that final trunk groups in the public telephone network are 
sized for an average blocking objective of one percent. The behavior of 
B in this low blocking region is illustrated by the lower curve in Fig. 3, 
which shows, as a function of the day-to-day load variation, the actual 
average blocking experienced by a trunk group sized under the as- 
sumption of no day-to-day variation. As we observed in Section I, B 
initially increases to a maximum value much less than 1.0, and then 
decreases to zero. In contrast, the probability of a call’s being blocked, 
Pz, monotonically increases to 1.0. However, within the range of day- 
to-day variation normally encountered (1.0 < ¢ = 1.84), the numerical 
difference between B and Pz is not great. 

Finally, let us examine the behavior of B on a trunk group sized for 
a high level of average blocking (e.g., 20 percent), assuming no day-to- 
day variation. Fig. 4 illustrates that day-to-day variation exerts an 
altogether different influence in this region. Namely, the average 
blocking decreases monotonically to zero, although the probability of 
a call’s being blocked again increases monotonically to 1.0. 


IV. SUMMARY AND CONCLUSIONS 


This paper describes an investigation of certain properties of the 
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Fig. 3—Blocking versus day-to-day variation. 


average blocking service measure (B) used for sizing final trunk groups 
in the Public Switched Network. The work was motivated by the 
results of a recent data study that suggested that levels of day-to-day 
load variation much higher than those considered by Wilkinson* and 
current Bell System engineering practices occasionally appear in the 
network. The analytical development presented in Section III con- 
firmed our numerical result of Section 1.2 that very high levels of load 
variation will result in low levels of average blocking. In addition, the 
properties of an alternate measure of blocking, Ps, were analyzed and 
compared to those of B. 

Our findings are summarized below: 

(t) For the low, objective level of average blocking (one percent) and 
the traffic conditions normally encountered in the Public Switched 
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Fig. 4—Blocking versus day-to-day variation. 


Network, the discrepancy between B and Pz is not great. That is, the 
current Bell System practice of sizing final trunk groups for a fixed, 
low level of average blocking also yields correspondingly low, though 
not uniform, levels of blocking probability. 

(ii) In contrast with B, engineering for a fixed level of Ps guarantees, 
on average, a fixed fraction of successful calls. However, in the presence 
of high day-to-day variation, the number of trunks required for an 
objective level of Pz is substantially greater than that required for the 
same level of B. Because, the choice of an engineering objective should 
consider both customer satisfaction and cost, we cannot conclude that 
Pz is a better objective than B. 

(zzz) In the range of high blocking, which is typical for private and/or 
special services networks, the discrepancy between B and Pz is sub- 
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stantial. In that case, our results suggest that consideration should be 
given to appropriateness of average blocking as a service measure. 
(iv) We showed that, under volatile traffic conditions, the assumption 
of a particular daily load distribution is crucial in quantifying the 
relationship between load offered to a trunk group and the average 
blocking of the trunk group. 


APPENDIX A 


In Section III, we considered the general class of blocking functions 
B(a) having the following four properties: 

(B1) B(0) =0Oand B(a) > 0 for a> 0; 

(B2) Bis continuous; 

(B3) B is nondecreasing; 

(B4) B is bounded. 
We now prove the following theorem: 


Theorem 1: Let {F;,} be a sequence of probability distribution func- 
tions on [0, ©) and let B(a) be any blocking function. Then 


lim | B(a)dF;(a) = 0 if and only if 
—> CO 0 
lim F(a) = 1 for alla>0. 


ko 


Proof. To show sufficiency, let « > 0. Since B(0) = O and B is 
continuous, a < 6 implies B(a) < «/2 for sufficiently small 5. Assume 
that B is bounded by M. Since 


lim F(a) = 1, 
kw 


for sufficiently large k, F,(5) = 1 — €/2M. For such k, 


foo] 


i) 6 
| B(a)dF;(a) = | B(a)dF;(a) + | B(a)dF,(a) S €/2 + €/2 =e. 


6 


Thus, 
tim | B(a)dF,(a) = 0. 
> 00 0 


To show necessity, suppose that for some ap > 0, 
lim F,(ao) ¥ 1. 
ko 
Then there exists a subsequence {n,} and a A; > 0 such that Fn, (ao) 
= 1— ),. Also, (B1) to (B3) imply that B(a) is bounded away from 
zero on [do, ©), say by Az > 0. Then, 
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i B(a)dF,,(a) = | B(a)dF,,(a) = » | dF,,,(a) = A2-Ai > 0, 
0 a 


0 ao 


which contradicts 
| B(a)dF;(a) — 0; 
0 


thus, 
lim F(a) =1 forall a>0O. 
ko 


QED 


APPENDIX B 


Theorem 2: Let {T(a|a, v)} be a family of gamma distributions 
with given mean a@ and variance v and suppose that a = 1 and 
Vvu/a — o, Then, 


T(a|a,v)—-1 forany a>0. 


Proof: By definition, the gamma distribution with fixed mean a@ and 
variance vu is given by 


T(a|a, v) | t*1e Fdt, (14) 
0 


where I(a) is the gamma function and the parameters a and B are 
determined by 








a = a/B, v = a/f”. (15) 
Integrating (14) by parts we obtain 
= _ B° —Btpaja B°B : —Btzpa 
T(a|a, v) = Tia)a ee |e + Pee. e't*dt. (16) 


Let us start by showing that the first term in (16) tends to 1 as the 
coefficient of variation tends to infinity. From (15) a = @/v and using 
the property of the gamma function we obtain 


T(aja=T(at1)>1 as a=a’/v—0. (17) 
From (15) we have 
—_ —2 _ 
Br={2|7” = exp| m=]. 
v ov 














Using the monotonicity of the exponential and logarithmic functions 
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we have 
2 


- = —2 2 
aa a 

exp |— In —| = exp Se = exp|— In—}. (18) 

v ov v= oU V v 




















Since d@ = 1 the ratio @/v — 0 as @*/v — 0. Thus, passing to the limit 
in (18) and noting that lim z In(x) = 0, we obtain that 


x—+0 
B*—>1 as a’/v->0. (19) 
Thus, from (17) and (19) the first term in (14) tends to 1 as a— 0 and 


B- 0. 

For the second term in (14) we get 
ieee e *t*dt < lim B lim B | a‘*dt =0. (20) 
a0 I'(a)a ; a0 I'(a)a a0 
p-0 p—0 poo 





Thus, if a = 1, for any a> 0, 
T(a|@,v)>1 as Vv/a>o. 


The proof is complete. 
We would like to remark that under the assumption a < 1, Theorem 
2 can be reformulated as follows: 


T(a|a,v)—>1 as v/aow 


for arbitrary a > 0. 
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The Continuing Evolution of the Military 
Standard 105D Sampling System 


By B. S. LIEBESMAN 
(Manuscript received June 18, 1981) 


Military Standard 105D is the most widely used set of acceptance 
sampling plans in the world. This paper reviews the early develop- 
ment of the standard and points out the many contributions made by 
Bell System researchers, such as H. F. Dodge. The paper also reviews 
recent analyses and indicates areas where the special structure 
suggested by Dodge, and adopted in the standard, has been extremely 
valuable. Finally, the paper identifies many questions related to the 
standard that are still open for investigation. 


1. INTRODUCTION 
1.1 Genesis of military standard 105D 


Lot-by-lot acceptance sampling began just prior to World War II 
and was given a large boost during the war because of the need to 
assure the quality of wartime material. Bell Laboratories personnel 
were heavily involved in the early development of sampling plans. The 
most prolific Bell Laboratories contributors were G. D. Edwards, H. F. 
Dodge, and H. G. Romig. 

The initial system of acceptance sampling plans was developed to 
assure wartime material. This system evolved through a number of 
changes to the current system of plans, Military Standard 105D. This 
standard is described in Ref. 1. H. F. Dodge was one of the leading 
contributors to the final development of this system. W. R. Pabst, 
long-time editor of the Standards Section of the Journal of Quality 
Technology, discussed Dodge’s contribution in a paper presented 
_ before the 17th annual convention of the American Society for Quality 
Control (as@c):” 


“Much of the theoretical work underlying the new MIL-STD- 
105D is directly attributed to David Hill and indirectly to Harold 
Dodge.” 
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Relatively few changes have occurred since 1963 when the MIL- 
STD-105D* system was published. What has happened instead has 
been an in-depth investigation of the properties of the system, with 
the result that changes have occurred in the way the system is used. 
Bell Laboratories Quality Assurance Center has been active in this 
investigation. The Center’s effort has been in support of the Western 
Electric Company Purchased Product Inspection organization, which 
uses MIL-STD-105D almost exclusively to inspect products purchased 
by the Bell System. 

Today, MIL-STD-105D is the most widely used system of accept- 
ance sampling plans in the world as shown in a 1970 Japanese study.” 
It forms the basis for the American National Standards Institute 
system, ANSI Z1.4; the Japanese system, JIS 29015; the International 
Standards Organization system ISO 2859; and the British System DEF 
131. Saniga and Shirland* estimated in 1977 that 76 percent of the 
quality control organizations in the United States use the system. The 
Japanese have made extensive use of MIL-STD-105D, a factor which 
may have contributed to the improved quality of Japanese products 
since World War II. Finally, it should be noted that the use of the 
system has spread to many types of items other than manufactured 
goods or raw materials. These include data records, maintenance 
operations, financial records, and administrative procedures. 


ll. DESCRIPTION OF MIL-STD-105D 
2.1 Basic definitions 


A number of terms are introduced in this section. These terms are 
important to the description of the system in Section 2.2 and of the 
issues, past and present, which are discussed in Sections III and IV. 
Definitions have been included in the glossary for handy reference. 

First of all, we define a lot as a set of items under control of the 
inspection organization for which an acceptance or rejection must be 
made. The items forming the lot must be similar in nature. A random 
sample is a subset of the lot which is selected in a manner which makes 
it representative of the lot. In a truly random sample, each unit in the 
lot has the same probability of being included in the sample. 

Inspection by attributes classifies individual samples as either “non- 
defective” or “defective” (good or bad, go or no go, etc.), depending 
upon whether they pass or fail certain tests of characteristics. The 
quality of a product is measured in terms of the percent defective or 
defects per hundred units. Inspection by attributes provides an esti- 
mate of the quality which is used for lot acceptance or rejection. 


* Military Standard 105D is commonly abbreviated using MIL-STD-105D or just 
105D. See the glossary for this and other abbreviations and definitions. 
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The operating characteristics (Oc) is a curve of the probability of 
acceptance as a function of quality for a given sampling plan. Figure 
1 is a typical oc curve. Note that there are two parameters marked on 
the curve, a good quality, AqL, and a poor quality, LTpD. Lots of quality 
equal to the AQuL value have a high probability of acceptance, while 
lots of quality equal to the LTPp value have low probability of accept- 
ance. 

The acceptable quality level (AQL) is the index for the plans of MIL- 
STD-105D. It is defined in Ref. 1 as “the maximum percent defective 
(or the maximum number of defects per hundred units) that, for the 
purposes of sampling inspection, can be considered satisfactory as a 
process average.” It is the AQL value at which the producer generally 
aims the quality of his process. If he produces at this level, he has a 
high probability that most of the lots will be accepted. 
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Fig. 1—Operating characteristic curve. 
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The lot tolerance percent defective (LTPD) is a quality level at which 
most lots will be rejected. Because of sampling error, some lots at this 
quality will be accepted. 

The average outgoing quality (AOQ) is the average quality of all lots 
that are shipped. The average outgoing quality limit (AOQL) is the 
upper bound on AOoQ when using a sampling plan that requires all units 
in all rejected lots be inspected and all defectives removed before 
shipment. The LTPD and AoQL values are not used directly in MIL- 
STD-105D. However, it is worth noting that there are plans based on 
these quantities as indices. The most commonly used set of LTPp and 
AOQL plans was developed by H. F. Dodge and H. G. Romig of Bell 
Laboratories.° 

The final definitions introduced in this section are based on the fact 
that MIL-STD-105D is a system of plans with a feedback mechanism. 
This mechanism consists of four phases of operation and the switching 
rules for transferring between phases. The normal phase is used when 
there is no evidence that the quality being submitted is poorer or 
better than the specified quality level. The tightened phase is used 
when there is evidence of poorer quality, while the reduced phase is 
used when there is evidence of better quality. The discontinue phase 
is entered when the producer has not been successful in improving the 
poor quality of his product during the tightened phase. 


2.2 Procedures of MIL-STD-105D 


An outline of the basic MIL-STD-105D procedures is given here to 
further introduce important terminology and to give the reader an 
understanding of acceptance sampling plans. For more details, refer to 
the military standard,’ and its accompanying handbook,’ or to a quality 
control text, such as that by Duncan’ or Grant and Leavenworth.® 

There are five steps used in the selection of a sampling plan. First of 
all, the lot to be inspected must be formed and the lot size determined. 
The lot should be as homogeneous as possible. Next, the sample size 
code letter must be found in a table as a function of the lot size. The 
third step is to determine the AQL value to use based on the quality 
requirements of the product. The fourth step is to select the sampling 
plan from another table based on the sample size code letter and the 
AQL value. Finally, a random sample is selected, tested, and the lot 
accepted or rejected based on the allowable number of defectives for 
the plan. 

These procedures are used to select an individual sampling plan for 
inspection of an individual lot. In reality, the system of plans operates 
over a sequence of lots using the phases and switching rules to provide 
consumer protection and control over the economy of inspection. 
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Figure 2 illustrates the basic switching rules. Generally, the inspec- 
tion of a product starts in the normal phase (N), where quality is 
assumed to be at the desired level. When two out of five consecutive 
lots are rejected, this is taken to be evidence of poor quality. The next 
lot is inspected under the tightened phase (T). This results in using a 
plan that will reject a higher percentage of lots at a given quality than 
the corresponding plan under the normal phase. 

The acceptance of five lots in a row during the tightened phase is 
evidence that the quality is back to the desired level. When this occurs, 
the next lots are inspected under the normal phase. However, if this 
does not occur before ten lots have been inspected under the tightened 
phase, sampling ceases and the discontinue phase (D) is entered. 

When a product has been inspected under the normal phase for a 
number of consecutive lots, it is eligible for reduced inspection. The 
reduced phase (R) is entered after ten lots in a row are accepted and 
a defect limit number criterion is met. The product remains in the 
reduced phase until either a lot is rejected, or is accepted but the 
number of defectives exceeds a specified number (Ac). 

Thus, we see that MIL-STD-105D is a sampling scheme with a 
feedback mechanism to reward good quality and improve poor quality. 
The manner in which this mechanism operates is not mathematically 
precise, but relies on the nature of human reaction to reward and 
punishment. This was recognized by the developers: 


“Whereas sampling plans are mathematically precise, the study 
of sampling schemes is not limited to pure mathematical consid- 
erations. In fact, many of the decisions related to the development 
of a sampling scheme are more a matter of art, opinion, esthetics, 
appeal, practical considerations and compromise.”” 


Often these words are lost on the practitioners who concentrate on 
the individual sampling plans rather than on the system. The author 
recently attended the 1981 Asqc Quality Congress where he had many 
discussions with users of MIL-STD-105D who were ignoring important 
aspects of the system. In essence, MIL-STD-105D was developed as a 
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Fig. 2—Dynamics of the switching rules. 
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system of plans and any abrogation of its features destroys its effec- 
tiveness. 


lll. DEVELOPMENT OF MIL-STD-105D 


We will now discuss the historical development of MIL-STD-105D. 
It is important to review this development because the current struc- 
ture of the standard is a result of compromises that arose over issues 
relating to the standard. In fact, MIL-STD-105D was created because 
of the need to provide industry with a system of sampled plans indexed 
on AQL values. 


3.1 Need for an AQL system 


Two other types of sampling plans were developed just prior to 
MIL-STD-105D and served as competitors of 105D through much of 
its early history. These are plans indexed by the lot tolerance percent 
defective (LTPD) and the average outgoing quality limit (Aoqu).° 

I. D. Hill discusses the reasons why industry became disenchanted 
with LTPD and AOQL plans and pressured instead for AQL plans of the 
105D type. He states that: 


“In normal situations, the process average corresponds to a high 
point on the OC curve; it has to, if the producer is going to make 
a profit. So it is really a high point rather than a low point which 
is the primary concern of both the producer and consumer.” 


In addition, Hill states that an LTPD plan is not cost efficient: 


“This system [LTPD] leads then in general, to the producer 
making, and the consumer receiving, a quality considerably better 
than is really necessary, and the price must reflect this.” ’° 


To compare MIL-STD-105D to an AoQuL system, we must first 
expand on the definitions of average outgoing quality (AoQ) and 
average outgoing quality limit (AOQL) given in Section 2.1. The Aoq is 
just the average quality of all lots that are shipped. In an AOQL system, 
a rejected lot must be inspected 100 percent and all defectives removed. 
Thus, a rejected lot is made perfect and shipped in sequence with . 
other accepted lots. It can be shown’® that the aoq has an upper 
bound called the aoa. Figure 3 illustrated this. The Ao@L value is 
generally used as the “guaranteed” quality to be provided by a supplier. 

The use of Aogu plans decreased when it was found that 100 percent 
inspection did not guarantee perfection. Hill states that: 


“Now it is well known in practice that 100 percent sorting is 
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AVERAGE OUTGOING QUALITY 





INCOMING QUALITY (q) 


Fig. 3—Average outgoing quality limit (Ao@L). 


unlikely to be properly done ... In stressing, therefore, that the 
AOQL concept requires perfection in the inspection operation, I 
am not claiming that other methods do not require this. It is 
merely that the lack of such perfection seems to matter more in 
the case of AOQL.””° 


He then proceeds to show that because of inspection error, the Aoq 
curve tends to have the shape of the dashed curve in Fig. 4 rather than 
the solid curve shown in Fig. 3. Note that the maximum value of the 
AOQ curve in Fig. 4 depends on r, the probability that an inspector will 
call a bad unit good. The inspection error has an effect on the portion 
of the curve to the left of the dashed curve, but this effect is so small 
that it cannot be shown in Fig. 4. 
The end result of an AoQu plan, according to Hill, is: 


“This Table (Table 1 in Ref. 10) shows that the use of an AOQL 
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Fig. 4—Effect of inspection error on outgoing quality under an Ao@t plan. 
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plan will, in general, force the producer to offer a quality a good 
deal better than the AOQL value, although not to such an extent 
as in the LTPD approach.” 


And, to summarize the reason for taking the AQL approach over the 
LTPD and AOQL approaches, Hill states: 


“Both the LTPD and AOQL concepts are capable of giving good 
protection against poor quality, provided they are used efficiently, 
but both do so at the cost of rejecting a good deal of satisfactory 
production.” ”° 


Thus, AQL sampling plans were developed to be indexed on quality 
values having a high probability of acceptance. This provided the 
producer with good protection against rejection of satisfactory lots. 
Protection for the consumer was provided by adoption of switching 
‘rules which recognize poor quality over a series of lots and take actions 
which put pressure on the supplier to improve. 


3.2 Important issues during the development of MIL-STD-105D 


A number of issues surfaced during the period leading to the 1963 
publication of MIL-STD-105D. Most dealt with the practical applica- 
tion of the theoretical framework of the sampling system. Dodge was 
a prolific contributor during this period because he had close ties to 
both the theoretical academic world and the practical world of Bell 
Laboratories and Western Electric. 

A change to the interpretation of the AQL is a prime example of 
Dodge’s influence. The early tables were designed to have the proba- 
bility of acceptance equal to 0.95 for quality equal to the Aqu value. 
Dodge proposed a special structure which resulted in the probability 
of acceptance for quality equal to the aqu value varying between 0.88 
and 0.99. This structure led to plans which were much easier to use by 
the general quality practitioner, because it consisted of a fixed set of 
sample sizes and a fixed set of AQL values for the entire range of lot 
sizes. 

Dodge suggested that the values of AQL and sample size both follow 
the same geometric progression based on multiples of 710 = 1.585." 
This resulted in the sequence of AQL values currently used: --- , 1, 1.5, 
2.5, 4, 6.5, «+»; and the sequence of sample sizes (n) currently used: 2, 
3, 5, 8, 13, 20, --- . Furthermore, the structure in the table of sampling 
plans was enhanced, because along any diagonal, the product of AQu 
and n is essentially constant and the acceptance number is a constant. 
(See Fig. 5.) Recently, this structure has been quite useful in a number 
of analyses which will be described in Section IV. 
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Fig. 5—Part of a table of sampling plans from MIL-STD-105D. 


The structure devised by Dodge had very strong practical implica- 
tions. The fixed set of AQL values and sample sizes were easy to 
understand, interpret, and use by the general quality practitioner. 

Dodge also provided inputs concerning tightened inspection and the 
switching rules. He suggested” that a switch to tightened inspection 
be done when two out of five consecutive lots fail. This was a change 
from the process average criteria used in early versions of MIL-STD- 
105D and was based on Dodge’s experience with users.” For tightened 
plans, he suggested the use of the same sample size as for normal plans 
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(adopted), so that inspection costs would not increase, and the use of 
an acceptance number one less than the normal acceptance number 
(partially adopted). The tightened plans adopted as part of 105D do 
have AOQL values that are close to AQL values of the corresponding 
normal plans, as Dodge had suggested." 

There were also questions concerning the sample size during the 
reduced phase, the sample sizes during double and multiple sequential 
sampling, and the relationships between lot size and sample size. 
Compromises were reached which resulted in the following: 

(t) The reduced sample size was set at 40 percent of the normal 
sample size: 

(ti) Sequential samples were set to be the same size as the first 
sample; 

(tzi) Empirical relationships for relating lot size to sample size were 
used; and 

(tv) Multiple sampling levels were added to give the user a variety 
of sampling options. 

Finally, there was controversy over analytical tools published to 
help the practitioner analyze his plans. The major tool that eventually 
was added to MIL-STD-105D was a set of charts and tables defining 
the oc curve for each plan. Discussions arose over the distributions to 
be used to calculate the probabilities, and a compromise was made: 

(t) The Poisson distribution would be used for AQL > 10; 
(ti) The Poisson approximation to the binomial distribution would 
be used for AQL = 10 and sample size =50; and 

(tii) The binomial distribution would be used for all other plans. 
Some analysts felt that different measures of plan capability would be 
more useful. For one, Professor Barnard of the Royal Statistical 
Society, in his comments on Hill’s paper,'® suggested the use of the 
average run length (ARL) needed to detect shifts in quality as a better 
measure of plan capability. 

The current version of MIL-STD-105D published in 1963 includes 
other analytical tools. These are tables to determine (Z) the AOQL and 
LTPD values corresponding to each plan, and (zi) charts showing the 
average sample size. 

We are now ready to examine the most recent period in the life of 
MIL-STD-105D. During this period, only minor changes have been 
made to the procedures. This stability has allowed researchers time to 
observe the use of the standard and to analyze the impact of the total 
system. Prior to this time, the research had concentrated on the 
properties of individual plans. 

An interest developed to provide tools to aid users in selecting sets 
of plans from 105D and to analyze the effectiveness of the combination 
of these plans under specified conditions. Some of this work will be 
discussed in the next four sections. 
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IV. ANALYSIS OF MIL-STD-105D 
4.1 System of OC curves 


The development of oc curves for the system of plans represented 
a breakthrough in the use of MIL-STD-105D because it recognized 
the effect of the total system and not just individual plans. This section 
describes the evolution of these curves, which took approximately 
thirteen years, and illustrates the difficulty of changing aspects of 
widely used procedures such as MIL-STD-105D. 

The MIL-STD-105D was developed under the assumption that the 
normal, tightened, and discontinue phases would be strictly adhered 
to.! Use of the reduced phase is considered optional. The need to 
follow all procedures was recognized by Dodge," Hald and Thyregod,”” 
and Stephens and Larson.*® 

Stephens and Larson of Western Electric Company were the first to 
explore the system oc concept. They constructed a Markov model of 
MIL-STD-105D considering two cases: (i) tightened, normal, and 
reduced phases combined, and (iz) tightened and normal phases com- 
bined. The discontinue phase was not included in either case. 

The end result of their model was a composite operating character- 
istic curve for the system: 


Pacq) = rrPar(q) + rnPan(qQ) + rrPar(q), (1) 
where 


Pac = system OC curve 
rr, 'n, rr = expected proportion of lots inspected during 
tightened, normal, and reduced inspection 
Par, Pan, Par = tightened, normal, and reduced oc curves 
gq = quality. 


In addition, they used the Markov model to find the expected number 
of units sampled per lot or the average sample number (ASN): 


ASN =rrnrt+rynyn + rene, (2) 
where 


nr, Nn and nr = sample sizes under tightened, normal, 
and reduced inspection. 


More recently, Schilling and Sheesley’””® used the Markov Model of 
Stephens and Larson to develop tables of system values for the AoqL, 
the limiting quality, the operating characteristics curve, the average 
sample number, the average outgoing quality, and the average total 
inspection. They have suggested incorporation of these curves in future 
revisions of MIL-STD-105D. 

Schilling and Sheesley’’ made a number of observations. Three of 
these will now be discussed. First of all they state that: 
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“Unfortunately, the standard [MIL-STD-105D] is frequently mis- 
used, particularly in nonmilitary applications, through the selec- 
tion and use of normal plans only—disregarding the tightened and 
reduced plans and the switching rules.” *” 


Perhaps the major objective of their papers was to show the value of 
using all of the rules; which results in “enhanced protection for both 
the producer and consumer.” *” 

They also note that the limiting quality, which is synonymous with 
the LTPD value, “is for use with isolated lots and does not reflect the 
limiting quality afforded by the MIL-STD-105D sampling system.” ”’ 

Finally, they state that “reduced inspection provides an obvious 
reward to the producer of a good quality product in terms of lower 
sample size and slightly higher probability of acceptance.” ’’ But, they 
indicate that part of the switching procedures (the use of reduced limit 
numbers) have a minimal effect on the system OC curves. 

Although the work cited in this section represented a breakthrough 
in the use of MIL-STD-105D, Hald and Thyregod,” Stephens and 
Larson,’* and others recognized a shortcoming in their own approaches. 
The main criticism is that they considered a static situation with a 
fixed level of quality as input to the inspection system. As noted by W. 
R. Pabst in the discussion of Ref. 15: 


“What would also be interesting, and perhaps more difficult to 
explore is the dynamic effect of these switching rules on the 
production process.” 


Stephens and Larson agreed and stated that: 


“Hence, the actual behavior of the process under the influence of 
the sampling procedure may thus be very dynamic.”’*® 


These comments influenced some of the work described in the next 
section and led the author to investigate the dynamic effects of the 
switching rules. 


4.2 Controlling average outgoing quality 


The average outgoing quality (aoq) was chosen as the parameter of 
interest in the analysis of the dynamic effects of the switching rules. 
This is because it is a measure of the quality of the product supplied 
to the customer. Hence, Ao@q is the “bottom line” for any sampling 
plan. The investigation in Ref. 19 centered on measurement of the 
effects of switching rule feedback to the supplier and the resultant 
quality provided to the customer. It was reasoned that the switching 
rules were included in 105D to cause the supplier to take actions 
resulting in improved quality when the tightened phase is entered: 
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“When the quality of a product degrades, the tightened inspection 
and discontinue features of MIL-STD-105D can be used to moti- 
vate the producer to improve.” ”” 


This is because fewer lots will be accepted under tightened inspection: 
if quality does not improve. This may be observed in Fig. 6, which is 
a typical comparison of oc curves for a normal and tightened pair of 
plans. The difference between the probabilities of acceptance is shown 
in the curve at the bottom. For example, when quality equals 2.5 
percent, this difference is about 25 percent. Hence, if quality remains 
at 2.5 percent defective during the tightened phase, about 25 percent 
more lots will be rejected than during the normal phase. This should 
act as a strong stimulus to the supplier to improve his quality. In 
addition, the discontinue phase and its consequences will be the end 
results of no improvement during the tightened phase. 

The approach described in Ref. 19 was based on three assumptions 
which differed from those used in Refs. 16 to 18. First, the discontinue 
phase was included in the analysis because the threat of discontinue 
(when enforced) strengthens the effect of the tightened phase on the 
supplier. The premise was that the switching rules were meant to 
provide feedback to the supplier causing him to improve his quality. 
Secondly, it was assumed that a reasonable supplier does respond and 
that the average outgoing quality will be based on the level of improve- 
ment. Thus, two levels of quality were considered: (7) gn, the quality 
during the normal phase, and (iz) gr, the quality during the tightened 
phase. This is in marked contrast to the assumption of a single quality 
level in Refs. 16 to 18. Finally, only accepted lots were included in the 
analysis, whereas the previous work had been based on all inspected 
lots. Note that only accepted lots affect the quality received by the 
customer. 

Under these assumptions, the equation for the average outgoing 
quality (Ao@) for the system of plans is 


qnEan qrE ar 


er ila MO eS 3 
Ean + Ear Ean + Ear (8) 


A0Q = 


where 
qn, Qr = quality during normal, tightened phase 


Ean, Ear = expected number of lots accepted during normal, 
and tightened phases. 


If the level of response (qr) is fixed and qn is varied, the result is a 
curve that is very similar to the aoa curve (Fig. 3). Then, if different 
levels of response are analyzed, we obtain a set of curves similar to 
those shown in Fig. 7. This figure gives an example of the Ao@ for the 
system of 105D plans. The curves are for an AQL of 0.65 percent, a 
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Fig. 6—Effects of tightened inspection. 


sample size of 125, and an Ac of 2. There is one curve for each set of 
values of quality during the tightened phase (qr). The maximum value 
on each curve is called the maximum average outgoing quality (AOQqM) 
value. 

Figure 8 is a sample plot of AoQmM as a function of the level of 
response (qr). The AoQM value is analogous to the AOQL value and if 
qr can be estimated, AoQM represents a limit on the quality provided 
to the customer. The result of this analysis is a demonstration that 
“MIL-STD-105D has feedback properties which tend to limit the 
worst average outgoing quality without relying on the mechanism of 
screening rejected lots.”® 


4.3 Selection of specific plans from MIL-STD-105D 


One of the main results of the AoQM analysis was a more compre- 
hensive look at the properties of the 105D sampling plans. An under- 
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AVERAGE OUTGOING QUALITY IN PERCENT DEFECTIVE 
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Fig. 7—Example of average outgoing quality for the system of MIL-STD-105D plans. 


standing of these properties generated a desire to use them in the 
selection of specific plans from the 105D system. 

The main result in Ref. 19 was a measure of the average outgoing 
quality based on the responsiveness of the supplier. Other measures of 
the plan capability were based on the ability of the 105D system to 
detect changes in quality. These were developed from the standpoint 
of control engineering. Professor Barnard’s comments on Hill’s paper 
point out that some of the early developers recognized the need for 
this type of analysis: 


“One particular thing which would have come out of this [view 
the scheme in terms of control engineering] would have been a 
description of the scheme, not in terms of the OC curve --- but in 
terms of the average amount of production passed after quality 
has deteriorated before the deterioration is picked up by the 
inspection.” ’° 
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Fig. 8—Maximum average outgoing quality (AoqM). 


The results in Ref. 19 led to the development of a preliminary set of 
criteria for evaluating MIL-STD-105D plans. Three quantities were 
considered in Ref. 20. First, the expected number of lots accepted 
during the normal phase prior to switching to tightened phase (En) 
estimates both the speed of detecting a change to poor quality and the 
false alarm jeopardy when quality remains good. Secondly, the ex- 
pected number of lots accepted during the reduced phase prior to 
switching back to the normal phase (Eryn) estimates the same quan- 
tities under reduced inspection. Both Ean and Epn are functions of 
quality during their respective phases. Finally, Aoqm estimates the 
limiting quality leaving the inspection system. The value of AoqmM 
chosen for this estimate assumes that the supplier will respond to a 
tightened quality level, g7, which will have only a small chance of 
causing a switch to the discontinue phase. 

Table I is a summary of the decision parameters developed to select 
sets of 105D plans. The value gg in the table represents a good quality 
level while gg represents a poor quality level. The value of qz is 
typically three times that of gc. 

The computation of the parameters in Table I for the plans of MIL- 
STD-105D is a large task. However, the structure proposed by Dodge” 
and adopted in MIL-STD-105D (see Section 3.2) simplified this prob- 
lem a great deal. 

A set of normalized tables and curves were presented in Ref. 20 that 
reduces the required information by a factor of 14. When quality is 
normalized by the aqu value, a single curve is needed for each accept- 
ance number rather than for each sampling plan. For example, the 152 
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Table I—Decision parameters for selection of 105D plans 


Parameter Phase Purpose 

1. Han(qge) Normal Estimate false alarm rate during the normal phase. 

2. Ean(qs) Normal Estimate response of a plan to a shift to bad quality 
during the normal phase. 

3. AOQM(q?) Tightened Estimate maximum average outgoing quality for a 
supplier’s change to a safe quality q?. 

4. Ern(qa) Reduced Estimate false alarm rate during the reduced phase. 

5. Ern(qa) Reduced Estimate response to a shift to bad quality during the 


reduced phase. 


oc curves under normal inspection can be reduced to the eleven 
normalized curves in Fig. 9. These curves give the probability of 
acceptance as a function of the normalized quality. The units of 
normalized quality are multiples of the AQqL value. Other curves and 
tables are provided in Ref. 20 which may be used to facilitate analysis 
of 105D sampling plans and help in the selection of appropriate AQL 
values. 

4.4 Other results based on the structure of MIL-STD-105D 

Analysts in the Bell Laboratories Quality Assurance Center recently 
investigated three questions which arose during the normal use of 
MIL-STD-105D by Western Electric inspectors. The first of these was 
concerned with the distribution of proportion defective in outgoing 
lots, assuming a distribution of quality in the incoming lots. Brush et 
al.* assume a beta distribution for incoming lot quality, and using the 
set of sampling plans from 105D, they find the mean, variance, and 
equivalent 0.90 beta quantile for the outgoing distribution.”’ Rejected 
lots are assumed to be scrapped. The special structure of 105D leads 
to a small set of normalized curves for the three outputs. These results 
provide the analyst with a powerful tool for determining the effect of 
a single sampling plan on the outgoing quality. 

The second question was also resolved with the aid of the special 
structure of 105D. This is the multiple group situation.” Questions 
arose over the use of 105D when the set of inspection characteristics 
is divided into groups each with its own sampling plan. The MIL-STD- 
105D encourages this situation.’® A conflict may arise because the 
supplier views quality in terms of each individual group, whereas the 
customer views quality in terms of collections of groups called cate- 
gories. The ratio, k, of category AQL to the individual group AQL was 
determined in Ref. 22, assuming each group uses the same sampling 
plan. The special structure of 105D led to the development of tables of 
k as a function of the acceptance number and the number of groups in 
the category. 

The third problem area was that of developing limiting quality or 
LTPD plans which are compatible with 105D plans. Again, the special 
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PROBABILITY OF ACCEPTANCE 





NORMALIZED QUALITY IN MULTIPLES OF ACCEPTABLE QUALITY LEVEL 


Fig. 9—Normalized oc curves. 


structure of 105D aided in the resolution of this problem. The plans 
developed by Duncan, Mundel, Godfrey,* and Partridge* use the same 
lot size-sample size relationship as 105D.” In addition, the sequence of 
limiting quality levels that indexes the plans and the sample sizes 
follow the same geometric progression as 105D. The authors have 


proposed a table of these plans for inclusion in the next revision of 
ISO 2859.4 


V. FUTURE RESEARCH 
5.1 Open questions 


The selection of the “best” sampling plans is certainly an open 
question. The analysis described in Section 3.3 gives the user a meth- 


* Members of Bell Laboratories Quality Assurance Center. 
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odology for selection of an AQL value when costs are unknown. When 
costs are known, or partially known, the problem becomes one of 
including the effects of MIL-STD-105D on total cost. The most recent 
work in this area was given in a paper presented at the 1981 Asqc 
Quality Congress.” In that paper, a cost model and simplified proce- 
dures were developed for selecting AQL values. The analysis should be 
extended to include sample size variation and its effect on total cost. 

A second open question is the amount of switching desirable during 
plan operation. Most of the interest in this question has centered in 
Japan, where researchers have developed a modification to 105D that 
includes different switching rules.’ Little attention has been given to 
this modification in the United States. 

A third open question is a comparison of the procedures of MIL- 
STD-105D with the procedures used by the Bell Laboratories Quality 
Assurance Center to audit Western Electric Company’s manufactured 
product. A new reporting system for the audit, gmp, has recently been 
developed.”© Future work should compare the cost basis and the 
feedback properties of the two systems. 

Finally, the three results discussed in Section 4.4 should each be 
extended. First, the study of outgoing distributions should be extended 
to include the effects of the system of plans. The original study 
encompassed only individual plans. Secondly, the multiple group sit- 
uation should be extended to groups using a mixture of sampling plans. 
And, finally, Lq@u plans should be incorporated in a system similar to 
105D, or these plans should be combined with 105D plans into a 
complete attribute acceptance system. 

Much of the early 105D development work is still open to review 
because of the nature of the system and its basis in compromise. In a 
continually changing environment, the 105D system may have to 
change. Good data are needed to evaluate many of the effects of 105D, 
while good analysts are needed to understand and extend the features 
of 105D. 


GLOSSARY 

Ac Acceptance number; if this number is ex- 
ceeded, either a lot is rejected or a switching 
rule is applied. 

AOQ The average outgoing quality; the average 
quality of all lots shipped. 

AOQL Upper bound of AOQ when all rejected lots 
are inspected 100 percent and all defectives 
are removed. 

AOQM Maximum average outgoing quality. 

AQL Acceptable quality level (good quality). 

ARL Average run length. 

ASN Average sample number. 
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Attribute 


Discontinue 


Kan, Ear, Ern 
IQ 

Lot 

Lot quality (q) 


LTPD 


MIL-STD-105D 
n 
Normal phase 


Operating character- 
istic (OC) curve 


OQ 

Pee 

Par, Pan, Par 
qn, QT 


r 


Yn, fr, Tr 


Random sample 
Reduced phase 


Screening 
Sequential sampling 


Switching rules 


A characteristic of a product which is classified 
as nondefective or defective, good or bad, 
yes or no, etc. 

The phase entered from the tightened phase 
when the supplier does not respond and 
improve his quality. 

Expected number of lots accepted during the 
normal, tightened, and reduced phases. 

Incoming quality. 

A set of items under control of the inspection 
organization for which an acceptance or re- 
jection decision must be made. 

Percent defective or defects per hundred units 
in the lot. 

Lot tolerance percent defective (poor quality 
level). Also called the limit quality level 
(LQL) or limit quality (LQ). 

Military Standard 105D. 

Sample Size. 

Entered when there is no evidence of better or 
poorer quality than desired. 


Probability of acceptance as a function of qual- 
ity. 

Outgoing quality. 

System OC curve. 

Tightened, normal and reduced OC curves. 

Quality level during normal, tightened phase. 

The probability that an inspector will call a 
bad unit good. 

The expected portion of all lots inspected dur- 
ing normal, tightened and reduced phases. 
A truly representative subset of a lot; each unit 
in the lot has the same probability of being 

included in the sample. 

Entered when there is evidence of good qual- 
ity. 

100 percent inspection of a rejected lot. 

A sequence of samples are used during which 
the decisions are to accept the lot, reject the 
lot or take the next sample; the average total 
sample size is smaller than for comparable 
single sampling plans. 

Rules for switching between phases. 
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Tightened phase Entered when there is evidence of poor quality. 
105D Military Standard 105D. 
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Expansions for Nonlinear Systems* 


By I. W. SANDBERG 
(Manuscript received April 22, 1981) 


In this paper we study operator-type models of dynamic nonlinear 
physical systems, such as communication channels and control sys- 
tems. Attention is focused on the problem of determining conditions 
under which there exists a power-series-like expansion, or a polyno- 
mial-type approximation, for a system’s outputs in terms of its inputs. 
Related problems concerning properties of the expansions are also 
considered and nonlocal, as well as local, results are given. In 
particular, we show for the first time the existence of a locally 
convergent Volterra-series representation for the input-output rela- 
tion of an important large class of nonlinear systems containing an 
arbitrary finite number of nonlinear elements. 


1. INTRODUCTION 


In this paper we study operator-type models of dynamic nonlinear 
physical systems, such as communication channels and control sys- 
tems. Attention is focused on the problem of determining conditions 
under which there exists a power-series-like expansion, or a polyno- 
mial-type approximation, for a system’s outputs in terms of its inputs. 
Related problems concerning properties of the expansions are also 
considered and nonlocal, as well as local, results are presented. In 
particular, we show for the first time the existence of a locally conver- 
gent Volterra-series representation for the input-output relation of an 
important large class of nonlinear systems containing an arbitrary 
finite number of nonlinear elements. 

With regard to background material, functional power series of the 
form 


* The material given in this paper was described in the writer’s “Conference Course” 
at the 1981 European Conference on Circuit Theory and Design, The Hague, August 
1981 
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Ro + » ic f Rnlft, T1, °° *,Tm)U(Ti) - ° U(Tm)aT1 eee dtm, (0) 


in which Xp is a constant, t is a parameter, and u and the k,, for m = 1 
are continuous functions, were considered in 1887 by Vito Volterra!” 
in connection with his studies of functions of functions. (These studies 
provided much of the initial motivation to develop the field now known 
as functional analysis.) About twenty years later, Fréchet? proved that 
a continuous real functional (i.e., a continuous real scalar-valued map) 
defined on a compact set of real continuous functions on [a, b] could 
be approximated by a sum of a finite number of terms in Volterra’s 
series (0), but with (in analogy with the well-known Weierstrass 
approximation theorem) the number of terms, as well as the Rm, 
dependent on the degree of approximation. 

It was Norbert Wiener* who first used a Volterra-series representa- 
tion in the analysis of a nonlinear system.* The form of Volterra’s 
expansion provided also the basis for Wiener’s later work (see, for 
example, Refs. 5 and 6) on nonlinear analysis and synthesis. His 
studies, which were concerned mainly with the modeling of systems 
when only input-output data (rather than the system’s equations) are 
available, stimulated considerable interest concerning Volterra and 
other’ functional expansions for nonlinear systems. It was appreciated 
from the outset that such expansions, when they exist, could provide 
important insight of a qualitative nature concerning the input-output 
behavior of a system, and that they could be useful in connection with, 
for example, the estimation and/or equalization of distortion caused 
by nonlinearities. 

There is a fairly large literature related directly or indirectly to the 
material of the present paper (see, for example, Refs. 1 through 29 and 
the references cited there). In most cases the functional expressions 
considered are Volterra series (or truncated Volterra series). With 
regard to systems for which the governing equations are known, with 
relatively few exceptions, questions concerning the existence of an 
expansion, its convergence, and/or the nature of the approximation 
provided by a truncated series are either not addressed or are left 
unanswered. (See, for instance, the remarks in Ref. 6, p. 137, on the 
lack of understanding concerning convergence.) 

On the other hand, some material regarding the range of validity 
and specific properties of functional expansions has appeared, both for 
systems governed by ordinary differential equations defined on a finite 
time interval [0, 7] (Refs. 15, 16, 20, and 23 are representative refer- 


*In Ref. 4 Wiener considers the problem of evaluating the output moments of a 
specific type of detector circuit driven by a random input. 
t See, for instance, Ref. 7. 
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ences), and for polynomic systems, which are modeled by operator 
generalizations of ordinary functions of polynomial form.”~” The work 
on differential equations is concerned mainly with the particular case 
of bilinear systems and with “linear-analytic” systems.'®”? The main 
result obtained asserts that under certain conditions (see, for example, 
Ref. 23) there does exist a locally convergent Volterra series for the 
solution (but with the size of the region of convergence dependent 
on 7). 

The studies of polynomic systems, which are operator theoretic in 
nature and which draw on the theory of multilinear forms, are more 
closely related to the results reported in this paper. The most pertinent 
earlier proposition” is one to the effect that if a certain contraction 
mapping condition is met, then it is possible to construct in a particular 
way a local inverse of a certain generalized power series. While we do 
not use previous results in the polynomic systems area, there are some 
points of contact with the earlier material, and this is discussed at 
appropriate places in Section II. 

We now briefly outline the remainder of the paper. Section II begins 
with some mathematical preliminaries. In Section 2.1, we introduce 
the general setting of concern throughout the rest of Section II. This 
involves two maps f and g related by /[ g(u)] = u for u in a certain set 
that can be thought of as a set of system inputs such that each u 
contained produces an output g(u). The remaining portion of Section 
II describes, proves, and discusses results concerning expansions and 
approximations of g(u). The material in Section II is somewhat ab- 
stract. Examples which illustrate how the material can be used to 
obtain more specific results of general interest are given in Section III. 


Il. APPROXIMATIONS AND EXPANSIONS 


Throughout the paper, # and #4 denote two Banach spaces, each 
with real or complex scalars, and X denotes a nonempty open subset 
of Zo. We use the symbol || - || for the norm associated with &, as well 
as for the norm associated with 4, and @ is used to denote the zero 
element of # and of A. 

It will become clear shortly that a central role in our development 
is played by first and higher order Fréchet derivatives (see, for instance, 
Ref. 30). Before proceeding, we recall a few pertinent facts and defi- 
nitions. 

Let F map X into &, and let xo be a point in X. If there is a bounded 
linear map L,, from 4 to Z such that || F' (xo + h) — F(x) —Lz,h|| = 
o(||A||) as ||A|] — 0, then F is said to be Fréchet differentiable at xo 
with Fréchet derivative L,,, which we denote by dF'(xo). If F'is Fréchet 
differentiable at every point in X, then we say that F is differentiable 
on X. Similarly, if F is Fréchet differentiable on X and dF‘(-) is 
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continuous, then F' is said to be continuously differentiable on X. 
Higher order Fréchet derivatives of F are defined in the usual inductive 
manner. 

Note that the second-order Fréchet derivative d°F' (xo), when it 
exists, is a bounded linear map from &p into the space L( #, &) of 
bounded linear maps from & into &. For h; and hz in Bo, d?F (xo) hihe, 
by which we mean [d?F (xo) fi Jhz, is an element of &, and, therefore, 
d°?F (xo) can be regarded as a bilinear map from 4 x Zp into F, i.e., it 
can be identified in the obvious way with such a map. This bilinear 
map satisfies the symmetry condition that [d?F (xo) AiJhe = [d?F (xo) he] 
hy. In general,®° the mth order Fréchet derivative d”F'(xo) for m > 1 is 
a bounded linear map from & into a Banach space of bounded linear 
maps with ||d”F(xo) || defined in the usual way in terms of induced 
norms, and dF (xo) can be regarded alternatively as a symmetric m- 
linear map from 4% into #. Moreover, for 1 = 1 < m and hy, he, ---, 
h; elements of 4, d"F (xo)hihz --+ hi is a bounded linear map from 
Zp into a Banach space of bounded linear operators. 

A result that we shall use is the following essentially standard 

inverse function proposition (Ref. 30, p. 273).* 
Lemma 1: Let F:X — & be continuously Fréchet differentiable on X, 
and let xo € X. If dF (xo) is an invertible map of AZ onto Z, there is an 
open'neighborhood V C X of xX such that F restricted to V is a 
homeomorphism of V onto an open neighborhood of F'(xo) in &. In 
addition, if F is r times continuously differentiable on V, the inverse 
mapping G of F(V) onto V is r times continuously differentiable on 
F(V). 

Also of importance in our work are derivatives of Banach-space- 
valued maps defined on an open subset S of the real or complex 
numbers. If Ff maps S into # and so € S, then F is said to have a 
derivative dF (so) /ds at so if dF (so)/ds is an element of # and we have 


i ly"LF (so + y) — F(so)] — dF (so)/ds|| = 0. 


Again, higher order derivatives are defined in the usual inductive way. 
Here derivatives are elements of &. 


2.1 fandg 

Throughout the paper, /:X — # denotes a map with the property 
that there is a nonempty open convex subset U of # such that for each 
u & U, there is in X a unique x, such that f(x.) = u. (Recall that X is 
a nonempty open subset of #o). 


* With regard to a difference in a hypothesis of Lemma 1 and the cited proposition 
in Ref. 30, we note that if A is a bounded linear invertible map of # onto # with inverse 
A7', then, by a result of Banach, this inverse is a bounded linear map of # onto Ao. 
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We shall be concerned primarily with the map g:U — X defined by 
flg(u)] = u for every u € U. 


2.2 A representation theorem for g(«) 


We shall refer to the following two hypotheses: 
H.1: For some positive integer p, the mth order F-derivative 
(i.e., Fréchet derivative) d”f exists and is continuous on X for m = 
1, 2, One fey (p + 1). 
H.2: [{df(x)]” exists [i.e., df(x) is an invertible map of # onto 4] for 
xEX. 
Theorem 1: Suppose that H.1 and H.2 are met, and let u and uo 
be points in U. For m = 1, 2, ---, p + 1 and each B € (0, 1], let 
&m(Uo, U — Uo, B) be defined as follows: 


1.(a) gi(uo, uU— Uo, B) = df{ g[uo + B(u — uo) ]}7(u — wo) 
1.(b) 2m(Uo, u — Uo, B) = 


—df{ g[uo + B(u — uo) ]}™ % ay" Rythyte  +hp= 


kj>0 


- d'f{ g[uo + B(u — uo) ]} gx,(Uo, U — Uo, B) gx,(Uo, U — Uo, B) 


+++ Bp(Uo, U— Uo, B)* (1) 
form =2,-+-,p+1. 
Then 8p+i(uo, U — Uo, 8) depends continuously on B for B & [0, 1], 
and we have 


&(u) = g(uo) + Bi(Uo, U — Uo) + +++ + Zp(Uo, U — Uo) 
+ (p+1) (1 — B)?gpii(Uo, U — Uo, B)AB, 
0 


in which gn(Uo, U — Uo) = Bn(Uo, U — Uo, 0) form = 1, ---, p. 
Moreover, 
2.(a) there are positive constants p and o, which do not depend on 
u, such that 








g(u) — g(Uo) — x £m(Uo, U — Uo) 








Spllu-wl?" for = |ju- wl =o, 


2.(b) there are positive constants pi, p2, +++, pp, Which do not 
depend on u, such that || 8m(Uuo, U — Uo) || S pm||u — uoll” for m = 
1, 2, se, D, and 


*In (1), > 


Rkythot---+ky=m 


k,>0 


denotes a sum over all positive ki, --- , k; that add to m. 
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2.(c) form = 1, 2, +--+, pwe have 8m(Uo, Ua — Uo) = 1" Bm(Uo, Ub — Uo) 
for Ua and uy in U such that (ua — Uo) = r(us — Uo) for some real 
number r. 

Proof: By H.1, H.2, and Lemma 1, the mth order F-derivative d”g (w) 
exists and is continuous for m = 1, 2, ---,(p+1) andw€E U. 

By a version of Taylor’s theorem for Fréchet differentiable maps 

(Ref. 30, pp. 190-1), 


g(u) = g(uo) + dg(uo)(u — uo) + 5 d’g(uo)(u — uo)? 


1 
het a es ~ d?g(Uo)(u — Uo)? 
p! 


lia _ pyp 
" | (1 —# dM glup + B(u — Uo) ](u — uo)? dB, (2) 
E ! 


where (uw — Uo)”, for m = 1, 2, ---,p +1, denotes [(u — wo), (u — Uo), 
«++, (WU — Uo)|] with mterms. | 

Since d”g(w) exists for m = 1, 2, ---, p + 1 for w belonging to 
the convex set U, d”g[uo + B(u — uo)]/dB” exists and is equal to 
d™g[uo + B(u — uo) |(u — Uo)” for B E [0, 1] and m = 1, 2,---,p+1 
[31, p. 198].* 

Let qg(f) denote g[uo + B(u — uo)] for all real B such that uo + 
B(u — uo) € U, and let g() stand for the mth derivative of q(B) 
with respect to 6 at the arbitrary point B € [0, 1]. 

We have /[q(8)] = uo + B(u — uo) when uw + B(u — uo) € U. Bya 
version of the chain rule (Ref. 31, p. 173) for the derivative of a 
composition function, | 


df[q(B)lq°(B) =u-wu, BE [0,1] (3) 
since df[q(8)] and g"(f) exist for B & [0, 1]. Thus, 
dg[uo + B(u — uo)(u — uo) = g(B) 
= df[g(uo + B(u—uo))}"(u— uo), BEl0,1] (4) 
Now let 2=m << (p + 1), and let f:A.— Band qo:(—~, ~) > B be 
defined by 


fol) = S, @)a'fa(BNILy - a8) (5) 


* More specifically, since with h = (u — uo), g[uo + (B + o)h] = glu + Bh) + 
dg(uo + Bh)ch + o(||ch||) as o— 0 (for B € [0, 1)), it is clear that dg(uo + Bh)/dB exists 
and is equal to dg(u + Bh)h for B € [0, 1]. Similarly, using d”“'g[uo + (B + o)h] = 
d”™""g(uy + Bh) + d”g(uo + Bh)oh + o(|\ch||) as o> 0 for 2 <= mS (p + 1), we see that 
d”g(uo + Bh)/dB” = d"g(uo + Bh)h” for 1 sms (p+ 1). 
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go(r) = q(B) + Pp» (1!) "(7 — B)'q(B) (6) 


for y € A and r € (—%, ~), where B € [0, 1]. We will use the following 
generalization of the classical rule for differentiating a product of two 
differentiable functions. 
Proposition 1: Vith S a Banach space and A an open interval in 
(—00, ©) containing a point ro, let L(r) denote a bounded linear map 
from & into S for each r € A, and let e(*) be a map from A to &. If 
dL (ro) /dr and de(ro)/dr exist, then d[L(r)e(r) |/dr exists at r= ro, and 
d[L(r)e(r)|/dr = L(r)de(r)/dr + [dL (r)/drje(r) at r = ro. 

A proof of Proposition 1 is given in Appendix A. Using Proposition 
1 and the observation that d”f[q(8)|/dB” = @ for B € [0, 1] when 2 = 
m = (p + 1), we show in Appendix B that d”{ folgo(r)]}/dr” | =~. = 8 
for B & [0, 1]. 

Since we have 


m m 1 
fol go(r)] = py (1!)"d f[q(B)] (3 (RI) "(r — pyta)) 


for every r, and each d'f[q(8)] can be regarded as a /-linear operator 
on &, it follows that for B € [0, 1]: 


0 = d™ fol go(r)]/dr™ | =p 
=m! > (2!)~* » (Rilke! +++ Ri!) 
l=1 


ky+kot+- . ++k,=m 
kj>0 


-d'fla(By a (B)q"(B) «++ (8). (2) 


Referring now to the g,,(Wo, uU — Uo, 8) in the statement of the 
theorem, notice that gi(uo, u — uo, 8) = q(R) for B € [0, 1], and that, 
by (7), 8m(Wo, U — Uo, B) = (m!)~'q'”(B) for 2 < m S (p + 1) and 
B € [0, 1]. Therefore, using (2) and g°” (8) = d”g[uo + B (u — uo) ]- 
(u — Uo)” for B € [0, 1] and each m, we obtain the formula for g(u) 
given in the theorem, in which g,+1(WUo, U — Uo, 8), which is equal to 


(p+ IP td? glu + B(u — uo)](u — uo)?*”, 


depends continuously on £ for 0 = 6 = 1. 
Since 


m 


Em(Uo, U — Uo) = (m!)~*d™g(uo)(u — Uo) 


for m = 1, 2, ---, p in which d”g(uo) is an mth order Fréchet 
derivative,* it follows at once that properties 2.(b) and 2.(c) hold. 


*In particular, d"g(u) and dg(uo)U1 «++ Un-1forlslsm-—landujeé #forls 
J =m -— lare bounded linear maps on &%. 
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Finally, and referring to (2), since 
| a4 (0)(t — wo) | Id") |] — wo"? 


property 2.(a) is a consequence of the result (Ref. 30, pp. 190-1) that 
for every oo > 0 there is a 0 > 0 such that 


for ||u — uol| So. 


g(u) — dg(uo)(u — uo) — 5 d’g(uo)(u — uo)? 





1 
_— apt” emu — Uo) ‘P*») 


S ao||u — wol'?*” 








2.3 Corollary to Theorem 1 


Corollary 1: Suppose that H.1 is met, and that uo € U is such that 
df[ g(uo)] ts invertible (i.e., is an invertible map of BZ onto &). 
Then there are positive constants p and o such that for u € U with 
||u — woll So, 


l|g(u) — g(uo) — gi(uo, u — uo) — +++ — Bp(Uo, U — Uo)]|| 
= plu — woll?*”, 


in which gi(uo, U — Uo), «++ , Zp(Uo, U — Uo) are defined in Theorem 1. 
In addition 2.(b) and 2.(c) of Theorem 1 hold. 

Proof: Since df(-) is continuous on X, and, by a theorem of Banach,” 
df{g(uo)]” is bounded, by a standard type of argument (see, for 
example, Ref. 30, pp. 154-5) df(-)~' exists and is continuous* in some 
open neighborhood I of g(uvo) in X. Also, since df[ g(uo)]* exists, 
contains an open neighborhood Ng.) and U contains an open neigh- 
borhood N,, of uo such that f restricted to Ng) is a homeomorphism 
of Ng.) onto N,, (see Lemma 1). Let = be a nonempty open convex 
subset of N,,,. At this point the corollary follows from Theorem 1 with 
X = Ngu,) and U = =. 


2.4 Comments 


For a fixed uo, the expansion given in Theorem 1 has the properties 
that the homogeneity condition 2.(c) is met and the remainder, the 
integral, is bounded above by p|| u — wo||'”*” for || uw — uol| < o for some 
positive constants p and o. A proof given in Ref. 33, p. 174 can easily 
be modified to show that the expansion is unique in the sense that 
there is no other similar [i.e., g(uo) plus p terms plus remainder] 
expansion of g(u) valid for all u € U with these homogeneity and 


* The continuity is not used in the present proof. It is used in Section 2.7 and in 
Appendix C, where reference is made to this proof. 
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remainder properties. Of course, a corresponding uniqueness proposi- 
tion holds in the case of the truncated expansion in Corollary 1. 

In some cases, d’f[ g(uo)] of Theorem 1 is the zero map whenever / 
is even and 2 </<p. Then g,(uo, u — Uo) = 8 for m even with 2 = 
m =p. This follows from a simple inductive argument using 1.(b) and 
the observation that ki + ko + --- + kis an odd number if / is odd and 
each k; is positive and odd. 

Referring to Corollary 1, we can establish the existence of an 
expansion in a more general setting. Specifically, let A, be a third 
Banach space and let h(-, -) be a (p + 1)-times Fréchet continuously 
differentiable map of So X S into 4, where So and S are nonempty 
open subsets of # and &%, respectively. Let xo and up be elements of So 
and S, respectively, such that (xo, uo) = 0, in which here @ is used to 
denote the zero element of #%,. Finally, assume that D,h(xo, uo) the 
Fréchet partial derivative of h(x, u) with respect to x, at the point 
(xo, Uo), is an invertible map of # onto F,. 

By the implicit function theorem in Ref. 30, p. 270 and a related 
proposition in Ref. 30, Result (10.2.3), it follows that there is an open 
convex neighborhood N of wo in S, and a (p + 1)-times Fréchet 
continuously differentiable map w of N into Sp such that w(uo) = xo 
and A[w(u), u] = 6 for u € N.* Therefore (2), with g replaced with w, 
is a representation about uo of w valid for u € N (Ref. 30, pp. 190-1). 
It can be shown that the terms in the representation can be determined 
by successively differentiating h{w[uo + B(u — Uo)], Uo + B(u — Uo)} 
with respect to B and setting the result equal to @. [Recall that 
D,h(xo, Uo) is assumed to be invertible, and see the proof of Theorem 
1.]' 

The proof of Theorem 1 shows that the recursive relation (1) arises 
in a natural way. Such formulas concerning the inversion of ordinary 
power series and/or the derivatives of composite ordinary functions 
are probably well known in some circles. In Ref. 29, similar relations 
are given in an abstract setting for the different problem of constructing 
a local inverse of a mapping that has a power series expansion. 

The expansion of g(u) in Theorem 1, and its associated trunca- 
tion in Corollary 1, each contains a constant term g(uo), a term 
&i(Uo, U — Uo) that can be written as L,,(u — uo) in which L,, is a 
bounded linear map, and a sum R,,,(u — uo) of higher order terms such 


* Also, N can be chosen so that w is the only continuous map of N into Sy such that 
w(Uo) = Xo and A[w(u), u] = 6 foru Ee N. 

t Similar remarks apply also in the case of Theorem 4, below. By applying either 
Theorem 2 or Theorem 5, below, to the map H:S» X S— B, X B defined by H(x, v) = 
[A(x, v), v] for (x, v) € So X S, the writer has obtained an explicit expansion, involving 
partial derivatives of h(-, -), for the solution x of h(x, u) = w in terms of u and w, under 
certain reasonable assumptions concerning A(-, -) and the sets from which u and w are 
drawn. The details will be given in another paper. 
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that R,,(A) = o(||A|]) as ||2|| 0. The following result shows that the 
hypothesis of Theorem 1 that H.2 is met, and of Corollary 1 that 
df [ g(uo)] is invertible, are not merely ones of convenience which allow 
an explicit expression to be given for the terms. 

Proposition 2: Let uo be an element of U such that df[ g(uo)] exists 
[respectively, such that f is continuously F-differentiable on a neigh- 
borhood of g(Uo) |. Suppose that there is a constant o > 0 such that 


&(u) = g(uo) + L(u — uo) + R(u — uo) (8) 


for u € U with || u — uol| < 6, in which L is a bounded linear map from 
B into ZB, and R(-) is a map of & into B with the property that 
R(h) = o(|\|A||) as ||h|| ~ 0. Then df[ g(uo)]" exists [respectively, 
df(x)~' exists and is continuous in x for x in some neighborhood of 
&(uo)]. 

The proposition is proved in Appendix C. It shows, for example, that 
H.2 is a consequence of the hypotheses that g(U) is an open set, X = 
g(U), fis differentiable on X, and (8) holds for each wo € U. In this 
connection, notice that because U is open, g(U) is open under merely 
the condition that f is continuous on X. 


2.5 Results for complex Banach spaces 


Theorem 2: Suppose that Z and & are over the complex field, that 
the F-derivative d™f(x) exists at each x € X for all m, and that H.2 is 
met. Let Up € U, and let p be a positive constant with the property 
that u + vu & U for |\v|| < p. Then for u € U such that ||u — uoll < p, 
we have 


g(u) = g(uo) + a &m(Uo, U — Uo), (9) 
in which 
£i(Uo, U — Uo) = df g(uo)]-*(u — uo), (10) 
and 


Em(Uo, U— Uo) = —df[ g(uo) PY (1!) ie y F d'f[ g(uo) lx, 
=2 yt gees + yam 


+ (Uo, U — Uo) Er,(Uo, U — Uo) +++ Bx(Uo,U— Uo), me2. (11) 


Proof: Here d”g exists on U for each m (see the proof of Theorem 1). 
In particular, dg exists throughout U, and therefore we see that 


lim z7'[ g(x + zh) — g(x)] 
z—0 
exists in Z (and equals dg(x)h) for each x € U andh € &, where z is 
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a complex scalar variable. Thus, by Theorem 3.16.2 of Ref. 34, p. 111 
and the development preceding it, 


a™g[uo + z(u — Uo) |/dz™|2=0 


exists for each m = 1, and we have 
g(u) = g(uo) + Y (m!)d™g[uo + 2(u — uo)]/dz”|z=0 
m=1 


for ||u — uol| < p. 
Notice that 


d”g[uo + z(u — Uo) ]/dz” |z-0 = d"g[Uo + r(u — Uo) ]/dr™|-=0 
in which r is a real variable. Since (see the proof of Theorem 1) 
da” g[uo + r(u — uo) ]/dr™|,-0 = m!8m(Uo, U — Uo), 


the proof is complete. 
Theorem 3: Let the hypotheses of Theorem 2 hold. Then for each uo 
there is ao > 0 such that the series on the right side of (9) converges 
uniformly for || u — wo|| < o. 
Proof: The proof of Theorem 2 shows, using the openness of U, 
that for each up there is a p > 0 such that the series converges for 
|| 7 — wol| < p. Since dg exists on U, g is continuous on U. The map g 
is, therefore, locally bounded on U in the sense of Ref. 34, Definition 
3.17.1, and thus the proof of Theorem 3.17.1 of Ref. 34, p. 112, shows 
that, given wo, there is a o > 0 such that the convergence is uniform for 
|| — Uo|| <o. 

The following result is obtained from Theorems 2 and 3 in the same 
way that Corollary 1 is proved. 
Theorem 4: Assume that 2 and & are over the complex field, and 
that df exists on X for each m. Let uo € U, and suppose that 
df[ g(uo)] is an invertible map of ZB onto Z. Then there isaa>0 
such that the expansion 


g(u) = g(uo) + 2 &m(Uo, U — Uo) 


is valid and uniformly convergent for u € U with ||u — uo|| < o, where 
&i(Uo, U — Uo), 2(Uo, U — Uo), +++ are defined by (10) and (11). 


2.6 Comments 


Under the conditions of Theorem 2 (respectively, Theorem 4) the 
infinite sum R(u — uo)of the terms go(uo, u — Uo), £3(Uo, U — Uo), °° 
has the property that R(h) = o(||All) as ||A|| — 0. (This follows from 
the fact that dg(uo) = df[ g(uo)]"'.) Therefore, Proposition 2, as well 
as remarks similar to those of Section 2.4, apply here too with regard 
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to the necessity of the hypothesis that H.2 is met (respectively, 
df g(uo)] is invertible). 

Following is an interesting corollary of Theorems 2 and 3. 
Corollary 2: Suppose that Z and & are complex Banach spaces, and 
that f is a C”-diffeomorphism of X onto U (i.e., that f is a homeo- 
morphism of X onto U such that f and its inverse g have F-derivatives 
of all orders on X and U, respectively). Let uo € U, and let p be 
a positive constant with the property that up + v € U for ||v|| < p. 
Then the series representation (9), in which the gm(uo, U — Uo) are 
given by (10) and (11), is valid for ||u — uo|| < p, and there isaa>0 
such that the series on the right side of (9) converges uniformly for 
|| — uol] <o. 

Proof: Since (df)~™ exists on X under the conditions of Corollary 2*, 
Corollary 2 follows from Theorems 2 and 3. 


2.7 Discussion 


Uniqueness propositions similar to the one described in Section 2.4 
apply in the cases of Theorems 2 and 4, as well as Corollary 2. Consider, 
for example, Theorem 2 and assume that its hypotheses are met. From 
(10) and (11) (or from the proofs of Theorems 1 and 2), we see that the 
expansion on the right side of (9) has the homogeneity property that 
for each m, 8m(Uo, Ua — Uo) = 1"Bm(Uo, Us — Uo) for Ua, Uy € U such that 
|| a — Uol| < p, || we — wol] < p, and (ua — Uo) = r(Uy — Uo) for some real 
r. Suppose that g(uo) + Yy=1 Am(Uo, U — Uo) is also an expansion of 
g(u) about up valid for ||u — wo|| < p, and that it has the corresponding 
homeogeneity property. Assuming, for the purpose of induction that 
hn(Uo, U — Uo) = Ym(Uo, U — Uo) for ||u — uo|| < p and1 =m <n for 
some nonnegative integer n, we see’ that for any fixed u such that 
||u — woll <p, 


An+i(uo, u — Uo) — 8n+1(Uo, u — Uo) 
lee} 


= YY [&m(uo, u — uo) — Am(uo, u — Uo) Jr" "Y= (12) 
2 


m=n+ 


for 0 < |r| < 1. Since 


sup|| 8m(Uo, U — Uo) — Am(Uo, U — Uo) || 


* We have /[ g(u)] = u and g[ f(x)] = x for each u € U and each x € X. Thus, by a 
version of the chain rule for differentiating a composite function (Ref. 31, pp. 171-2), 
df{ g(u)]dg(u) = I and dg[ f(x)]df(x) = Ip for each u and x, where J and Jp are the 
identity maps on # and Ap, respectively. This shows that df(x) has both a right inverse 
and a left inverse for each x € X, and, therefore, that (df)~* exists on X. 

* This type of observation is used in Ref. 33, p. 174, to prove the uniqueness result 
given there. 
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is finite, the right side of (12) approaches zero as r —> 0. Thus An+i- 
(Uo, U — Uo) = Bn+i(Uo, U — Uo), and, therefore, hn(Uo, U — Uo) = Zn(Uo, 
u — Uo) for all m and all wu such that ||u — wol| < p. 

The proof of Theorem 2 is based in part on basically well-known 
results concerning Banach space valued functions of a complex vari- 
able. Such results are used also in, for example, Refs. 20 and 24 for 
related but different purposes. 

A comparison of Theorems 1 and 4 leads us to ask whether Theorem 
1 can be used to prove a result along the lines of Theorem 4 for cases 
in which # and & are not necessarily over the complex field, but the 
\|d’f(x) || are sufficiently small in some not too restrictive sense for x 
near g(uUo). In this connection, we have the following. 


Theorem 5: Let d”f exist on X for each m, and let up be a point in U. 
Suppose that there are positive constants 6 and y, and a neighborhood 
No of g(uo) in X, such that ||a”f(x) || = m!dy” for x € No and every 
m = 2. Assume that df[ g(uo)| is an invertible map from & onto &. 
Then the conclusion of Theorem 4 holds. 

Proof: By the proof of Corollary 1, it suffices to show that there is a 
o > 0 such that 


(p + 1) | (1 a B)?Bp+1(Uo, u— Uo, B)dB, 
0 


the remainder in the expansion for g(u) of Theorem 1, approaches @ as 
p— © uniformly for || uw — uo|| < o. Since fo (1 — B)?dB = (p + 1)"’, it 
is enough to prove that there are constants o > 0, d >0, and c > 0 such 
that for all p, all 6 € [0, 1], and all uw € U with ||u — ull < o, we have 
||Zp(Uo, U — Uo, B)|| S ce~”. That we do as follows. 

By the continuity of g at uo, and the continuity of (df)~’ at g(uo) 
(see the first part of the proof of Corollary 1), choose o > 0 so that 
U contains the open ball of radius o centered at wo, and ||df{ g[wo + 
B(u — u)}}""|| S ec: and ||d’f{ g[uo + B(u — uc)]}|| S L!Sy’ for some 
constant c, whenever £ & [0, 1] and || u — wol| < o. 

Choose any positive number d. For each m, let h, denote 
sup {|| e?”"gm(wo, U — Uo, B)||:8 € [0, 1], || — uol| < o}. From Parts 1.(a) 
and 1.(b) of Theorem 1, and our hypotheses, the h,, are finite, and we 
have 

P pom 
y; hn = e*cic0 + ¢c16 oy >, y yi, he, eee hi, 


m=1 m=2 l=2 kj +kot+---+k,=m 
k;>0 


for each p > 1. Since 
p-1 : 
(‘5 in) = Dain +++ fy 
m1 
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over (ki, ko, «++, kr) € {1, 2, «++, (p — 1)}/, it easily follows that 


Pp p-1 
> hr, hr, eee hi, — ah +( > hn) . 
= m=1 


m=2 l=2 ky +khot-+-+k,=m 
k;>0 


Pom 


Thus, with 
P 
Sp = > hn, 
m=1 

we have 

P 

Sp e“ci0 + C16 »> (ySp-1)' 

1=2 

for p > 1. 


Now let c > 0 be chosen such that 
= 1 
ad ¥ (ye) S=6, 
1=2 2 


and, if necessary, reduce o so that e“c,o < %c. Since s; = %4c, and 
S(p-1) = ¢ implies that s, S c for p > 1, it is clear that s, = c (and hence 
h, = c) for all p, which completes the proof. 


2.8 Comments 


Since the hypotheses of Theorem 5 can be shown to ensure the local 
existence of a Fréchet “power series” expansion (see Ref. 30, p. 190) of 
f about g(uo), another way to prove Theorem 5 is to use the result 
stated in Ref. 29. The observation concerning the representation of 
(Sp-1)’ as a sum of products used in our proof to obtain the inequality — 
involving s, and sp-; (but not the exponential weighting approach) is 
used also in Ref. 27 for a case that corresponds here to the one in 
which only a finite number of the || d’f|| do not vanish. 

Theorem 1 can also be used to prove nonlocal convergence results 
when # and & are not necessarily complex spaces. For example, let 
p be the radius of any finite open ball contained in U and centered at 
Uo. Suppose that for every x € X the following is true: d”f(x) exists for 
each m, df(x)~* exists, and || df(x)~*|] S po for some constant po. Then, 
using Theorem 1, it can be shown that if the ||d”/(-) || satisfy certain 
smallness conditions on X for m = 2, the expansion described in the 
conclusion of Theorem 2 converges uniformly to g(uw) for ||u — uol| < p. 
The “smallness conditions” are met if, for example, 


sup||d”f(x)||=0 for m=M forsome M22, 
xEX 
and each nonzero 
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sup||d”f(x) || 
xEx 


with m > 1 is sufficiently small. The details and a proof will be given 
in a later paper. 


2.9 Properties 1 and 2 


We conclude this section with a proof of a proposition used in 
Section III, where attention is directed to cases in which the elements 
of Zand & are functions of a time variable ¢. The proposition is used 
to show that under certain very reasonable conditions, causality and 
time invariance (or periodicity of variation)* are properties which, 
when possessed by g, are inherited by the terms gi(Wo, U — Uo), +++, 
&p(Uo, U — Uo) in Theorem 1, and by the terms gi(W, u — wo), 
82(Uo, U — Uo), + - in Theorem 2. We first introduce some preliminaries. 

Let 2 denote a nonempty set of real numbers. For each w € Q, let 
T., and To, denote linear transformations of # and #o, respectively, 
such that || To..w|| = ||w|| for » € 2 and w € A. Let S be a subset of 
8 such that T,,S C S for w € Q. Let J denote an open interval in the 
set R' of real numbers. 

We say that a map F:J X S — & has Property 1 on S at a point 
red if 


Tow.F (r, v) = Tok (r, T..v) 
for all v € S and w € &. Finally, we say that F:J x S > & has 
Property 2 on S at a point r in J if 
To. (r, v) = F(r, Tv) 


forv€ Sandwe€éQ. 

Proposition 3: Suppose that F:J X S — & has Property 1 (respec- 
tively, Property 2) on S for each r € J, and that for an arbitrary v € 
S the derivative dF (r, v)/dr exists at each r © J. Then the map H:J 
xX S— ZB, defined by H(r, v) = dF (r, v)/dr for each r and each v, has 
Property 1 (respectively, Property 2) on S for each r & J. 

Proof: Assume initially that F has Property 1. Let arbitrary r € J and 
v € S be given and let £ be a real variable. Using 


lim||B “LF (r + 8, v) — F(r, v)] — Hr, v) || = 0, 


we have 


lim ||8 [Tow (r + B, v) — TouF(r, v)] — To.H(r, v)||=0 (18) 


* See Section 3.1 for the pertinent definitions. 
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for any w € Q. Since (13) holds also with v replaced with T.,v, and 
using the hypotheses that F has Property 1 on S at r and at (r + 8) for 
sufficiently small 8, we find that 


lim|| A(B) + TooH(r, v) — TouH(r, T.v)|| = 0, 


in which A(B) = B-[To.F (r + B, v) — To.F (r, v)] — TowH(r, v). By 
(13), ||A(B)|| —~ 0 as B — O. Therefore, we have To,H(r, v) = 
To.fi(r, T..v) for arbitrary w € Q, as claimed. The Property 2 part of 
the proposition can be proved in essentially the same way. 


lll. APPLICATIONS AND EXAMPLES 


Throughout this section, we consider cases where each element of 
Z, and also of Z, is a function of a time variable ¢. Specifically, we 
now assume that each element of # is a map from a set T of real 
numbers into a linear space V with zero element Oy, and, similarly, 
that the elements of Z are maps from T into a linear space Vo with 
zero element Oy. 

We shall be concerned mainly with the cases where either Z = 
Bo =L.(R) or # = ZB = L.(C), in which by L..(R) [respectively, 
L..(C)] we mean the real (respectively, complex) Banach space of 
(Lebesgue) measurable* real (respectively complex) column n-vector 
valued functions v defined on the interval [0, ©) such that the jth 
component u; of v satisfies 


sup|uj(t)|< 0 for j=1,2,---,n, 
t>0 


and where the norm || - || on Z..(R) or L..(C) is given by 
vl] = max sup| (2) . 
J 
(As usual, n denotes an arbitrary positive integer.) If, for example, 
B= ZB = L.(R), then T = [0, &) and we can take V and Vo to be R”. 
3.1 Causality and time-invariance 


Referring to Proposition 3 and the associated definitions, let Q = T, 
and initially let T., (respectively, T>.,) be the “time-truncation” oper- 
ator defined on & (respectively, Zo) by (T.,v)(t) = v(t) for t < w, and 
(T.,v)(t) = Oy for > w (respectively, by (7>,.v)(¢) = v(t) for tS w, and 
(To..v)(t) = Ov, for t > w) for each v and each w. Assume that for w € 
T, T., and To,, map # and & into themselves, and that || To.v|| = || v|| 


* See, for example, Ref. 35. 
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for w € T and all v. [Notice that these assumptions are satisfied if, for 
example, @ = 4 = L.(R) or B= A = L..(C).]* 

Suppose that the hypotheses of Theorem 1 are met, that U is an 
open ball centered at uo = 9, that T,,U C U for w € T [which is clearly 
satisfied if @ = L..(R) or L..(C)] and initially assume that g is causal 
on U in the sense that To,,g(v) = Tou. g(T..v) for any v€ Uandwe T. 

Let G,,(u) denote gm(9, u) of Theorem 1 for m = 1, 2, ---, p and 
u € U. We observe that g(Bu) is an element of A for each B € (—1, 1) 
and each u € U. By the proof of Theorem 1, d”g(-) exists on U for 
m = 1, 2, ---, p, from which it follows that d”g(Bu)/dB” exists for 
BE (-1, 1), u € U, and m = 1, 2, --- , p [81, p. 198]. By the proof of 
Theorem 1, m!G,,(u) = d"g(Bu)/dB” at B = 0 for each m and u. 
Therefore, by the Property 1 part of Proposition 3 [with S = U and 
J = (—1, 1)] and an obvious inductive argument, it follows that each 
Gn:U — @ is causal in the same sense that g is causal.’ 

Now suppose that T is one of the four sets [0, ©), (—00, ©), 
{0, 1, 2, ---}, or {0, +1, +2, ---}. Again take Q = T. Let T., (respectively, 
To..) denote the “time delay operator” defined by (7T.,v)(¢) = Ov for 
t<w and (T.,v)(¢) = v(t — w) for t = w when either T = [0, ©) or T= 
{0, 1, 2, ---}, and by (T.v)(t) = v(t — w) when T = (—, 0) or T = 
{0, +1, +2, ---} (respectively, (T.,v)(¢) =@v, for t << w and (7,,v) (¢) = 
u(t — w) for t = w if T is either [0, ©) or {0, 1, 2, ---}, and (T,,v)(¢) = 
v(t — w) when T is either (—, ) or {0, +1, +2, ---}). Assume here, 
as above, that T,, and To,, map # and A, respectively, into themselves, 
that || T.,v|| s ||v|| for each v and w, that the hypotheses of Theorem 
1 are met, that U is an open ball centered at 0, and that uo = 6. 
Consider the case in which g is causal on U, and g maps the zero 
element of # into the zero element of #4. Assume that g is tume 
invariant on U in the sense that To, g(u) = g(T..u) for u€ U and 
w€T. Let Gn be as defined in the preceding paragraph. By the 
Property 2 part of Proposition 3, and the observations concerning 
d™g(Bu)/dB” in the preceding paragraph, we see that each Gn, (m = 
1, 2, -»-,p) is time invariant on U. 

The material just described can be modified to address the case in 
which g is periodically varying with a given period +. Specifically, 
suppose that T is [0, 0), (—00, 00), {0, 1, 2, ---}, or {0, +1, +2, ---}, and 
that 7 is a positive element of T. Let T., and To,, be as defined in the 
preceding paragraph, but with 2 taken to be the single-element set 
{7} rather than T. Then, in the setting described in the preceding 


[0 bs 7 et a are not met for # the set of bounded continuous functions from 
, 0) to R’. 

* Our definition is consistent with the one introduced in Ref. 36, p. 888, concerning 
causality for operators between abstract spaces. Also, a related result is given in Ref. 37, 
p. 40, for polynomial operators. 
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paragraph, T)..g(u) = g(T.,u) for u € U and w € & means that g is 
periodically varying with period t on U, and we see that if g has this 
property, it is inherited by the Gn. 

In the case of Theorem 2, each d™g(-) exists on U (because H.2 
holds), and gn(0, u) = (m!)~'d”"g(Bu)/dB” at B = 0 for each m = 
1, 2, +--+ and ||u|| < p. Therefore, results essentially the same as those 
developed in the preceding four paragraphs hold also with regard to 
the terms in (9).* 


3.2 An application of theorem 1 


Our first example, as well as the example in Section 3.3, concerns a 
nonlinear integral equation that plays an important role in the theory 
of feedback systems. To introduce the equation, we need the following 
definitions. 

Let ao and a; be positive numbers with ao S aj, and let Wi, Yo, --- , 
vn be a collection of (p + 1)-times continuously differentiable functions 
from R’ onto R! such that ¥;(0) = 0 and ao = d¥j(A)/dA S q for all i 
and all A. Below, for convenience, we shall use y{” to denote the mth 
derivative of y;. Let Y denote the map from R” into R” defined by 
[W(s)]: = Yi(s;) for t = 1, 2, ,n and all s € R’, in which [y(s) ]; and 
sj are the ith components of Ue) and s, respectively. 

Let & denote an n X n matrix-valued function defined on [0, ©) such 
that each k;; is measurable, bounded, and satisfies 


| | kis(t)|dr < 0, 
0 


In this and the following section, each &;; is assumed to be real 
valued. 
Consider the equation 


x(t) + i R(t — 7) x(r) ]dr = u(t), t=0, (14) 
0 
as well as the related equation 
y(t) + ( k(t — r)D(t)y(17) dr = v(t), t>0, (15) 
0 


in which wu and v are elements of L..(R), and D is a real n X n diagonal- 
matrix-valued function defined on [0, ») such that each D;; is measur- 
able on [0, 0) and satisfies ao = D,i(r) = a1 for each r. Since y satisfies 


* Specifically, the development remains valid if U is taken to be {u € &:|u|| < p}, 
and if “Theorern 1” and “m = 1, 2, ---, p”, respectively, are replaced with “Theorem 2” 
and “1, 2, ++. 


176 THE BELL SYSTEM TECHNICAL JOURNAL, FEBRUARY 1982 


a global Lipschitz condition on R”, and & is as indicated, a standard 
successive-approximations approach (see, in particular, Ref. 38, Sec- 
tion 1.13) can be used to show that for each uw and v in L..(R), (14) and 
(15) have solutions x and y, respectively, in the space E of functions w 
from [0, ©) into R” such that 


A 
| | wi(7) ?dt < 00 
0 


for each A € (0, 0) and each 1, and that x and y are unique in E.* To 
fix ideas, assume (this is often very easy to justify) that the only 
solutions of (14) of interest to us are those that are contained in E. 

Let A.1 denote the hypothesis that for each u and v in L..(R), L.(R) 
contains any solution x of (14) in E, as well as any solution y of (15) in 
E. For explicit conditions on k, a, and a, under which A.1 holds, see 
Ref. 36, Theorem 3. 

Assume initially that A.1 is met, and notice that then for each pair 
of elements u, v € L..(R), the space L..(R) contains exactly one element 
x such that (14) is satisfied and exactly one element y such that (15) is 
met. In particular, we see that we can take f in Section 2.1 to be the 
map of L..(R) into itself defined by 


t 
f(w)(t) = w(t) + | k(t — r)Y[w(r) |dr, t=0 

0 
for each w € L..(R), and can take 4, ZB, X, and U to be L..(R). In this 
example, g is defined on all of Z. 

To discuss the example in more detail, it is convenient to let K and 
denote the maps of L..(R) into L..(R) defined by 
t 
(Kw) (t) = | R(t — t)w(r)dr, t=0 


0 
(Ww)(t) = Yw(lt)], t20 


for each w € L..(R). Thus, here f= J + KY in which J is the identity 
map on L..(R). Consider WV. 

Proposition 4: d¥ exists on L..(R), and for w and h in L..(R), we have 
[da¥(w)h](t) = Do(t)A(t) for t = 0 in which Dot) is the diagonal 
matrix diag[¥\?[wi(t)], YL we(t)], «++ , WO [wa(t)]}. 


* The integral on the left side of (14) can easily be shown to be an element of R” for 
each ¢ whenever x € E. Since the value of the integral is unchanged if x is replaced by 
any element of E that agrees with x almost everywhere, (14) has a solution if there is an 
element of E that satisfies the equation almost everywhere, and, moreover, any solution 
x € Eis unique and not merely essentially unique. Similar remarks apply in the case of 


(15). 
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Proof: For w and h in L..(R), 
V(w + h)(t) — ¥(w) (Et) 


= Do(é)h(t) + | [Dog(t) — Do(t)]dB-h(t), t20, (16) 
0 


in which Dog(t) is the diagonal matrix of order n whose ith diagonal 
element is ¥{?[A[wi(t) + hi(t)] + (1 — B)wi(t)]. Since each ¥!” is uni- 
formly continuous on compact subsets of R’, we see that for any fixed 
w, the difference (Dogi — Doi) of ith diagonal elements satisfies 


sup | Dogi(t) — Doi(t)| > 0 as ||h|]— 0. 


Thus, with 6 € L..(R) defined by d(¢) = f5[Dog(t) — Do(t)]dB-A(t) for 
t = 0, we see that || 4|| = o(||/||), which proves the proposition. 

Proposition 4, together with the discussion above concerning (14) 
and (15), show that df exists, that (using the linearity of K) df = 
I + Kd¥(w) for each w, and that for any w the map df(w) is an 
invertible map of L..(R) onto itself. 

To make further progress, we need the following result which is 
more general than Proposition 4. 


Proposition 5: For each m = 1, 2, «++, (p + 1), d”¥ exists and is 


continuous on L..(R), and, with hi, ho, +++, hm any m elements of 
L..(R), we have 


[a (w)hy «++ inl): = Vwilt)] TL Ault), t= 0 
j=1 


for eachi=1,2,---,nandanyw€EL.(R). 
Proof: Let w be given. By Proposition 4 and, (with regard to 
continuity) the observation that ||\d¥(w + v)h — d¥(w)hA|| s 
max; suproo|¥!”[w,(t) + vi(t)] — Yf[wi(t)]| for ||A|| = 1, the assertion 
for m = 1 is true. Suppose that the assertion is true for m such that 
ls=mslwithl<(p+1). 

Let Q,(w) denote the continuous multilinear mapping of L..(R)“*” 
into L..(R) defined by 


+1 


[Qi(w) (Pr, pa, +++ Pe) EY] = YP [welt)] TT pit), = 0 


for each i and for p; € L..(R) for all 7. We shall use Q,(w) to denote the 
usual associate (Ref. 31, p. 318) of (ay that belongs to L(L..(R), L(L.(R), 
---, L(L.(R), L..(R))---)) with (J + 1) L’s, in which L(Ai, Az) stands 
for the set of continuous linear operators from the Banach space A 
into the Banach space A2.* 


* For example, if / = 2, L(L..(R), L(L.(R), ---, L(L.(R), La(R))-+-)) = L(L.(R), 
L(L..(R), L(L.(R), Le(R)))). 
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By our induction hypothesis, and with fA, as well as h; for 7 = 
1, 2, --- , d, elements of L..(R), 


|a’'h(w + h)hi +++ hi — d’¥(w)hy «+» Ai — QAw)hhy «++ hil 


Ll l 
= max sup MTwi(t) + hi(t)] I hy(t) — ¥f?[wit)] IT h(t) 


l 
— WP Lwilt) rit) T] Ajit) | S c(h) II |Z], 


where 
c(h) = max sup |W? [wit) + Ai(t)] — YP [wi(t)] — Yf?[wi(t) ri(E) |. 


Thus, ||d’¥(w + h) — d’¥(w) — Q(w)hA|| S c(h). By the uniform 
continuity on compact sets of the y{*” (see the proof of Proposition 
4), we have c(h) = o(||A||) as |||] ~ 0, which shows that dt” ¥(w) 
exists and that d“)¥(w) = Q,(w). Since ||Q(w + h)pi +++ pi — 
Qi(w)pi +++ Pisil| = 


7+1 


max sup | Y{*"[wi(t) + Ai(t)] — Yi" [wilt)]|- TT bill 
t t=0 j=1 


and the yt” are continuous, we see that 
| Quw +h) — Qi(w)||>0 as |h\| > 0, 


showing that d‘’*”¥(w) depends continuously on w. This completes 
the proof. 

Returning now to our example, by Proposition 5 and the linearity 
and boundedness of K, we see that d”f(w) exists and is continuous for 
w € L..(R) and m = 1, 2, --- , (p + 1). (Of course, d"f = Kd” for 
1<m<=(p+1).) Therefore, the hypotheses of Theorem 1 are satisfied. 
Now choose up = 8 in Theorem 1 and notice that g(@) = 6. 

Referring to the standard successive approximations technique (Ref. 
38, Section 1.13) that can be used to construct a unique solution x in 
E of (14) for each u € L..(R), by the rule by which the iterates are 
generated it follows that g is causal and time invariant on L..(R) in the 
sense of Section 3.1. Therefore, the material in Section 3.1 shows that 
8m (9, -) of Theorem 1 is both causal and time invariant on L..(R) for 
each m = 1, 2, --- , p.* 

The terms g1(0, u), g2(8, u), -++ , gp(8, u) in the expansion in Theorem 
1 can be determined using 1.(a) and 1.(b).' For example, with H 


* The same conclusion can be reached by considering the specific form of the 
&7n(9, u), and using the fact that the operator H introduced below can be shown to be 
causal and time invariant. (In this connection, see Lemma 2.) 

* The recursive process is straightforward in principle, but the complexity mounts 
rapidly with increasing order. 
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denoting [J + Kd¥(6)]~’, we have gi(0, u) = Hu, and, using g(0, u) = 
Hu, we find that when p = 2, 


£2(0, u) = —42HKd*¥(6)(Hu)* (17) 


since k; + ko = 2 with k; and k2 positive integers requires that ki = 
ko = 1, and 


g3(0, u) = —Y%HKd*¥(6)(Hu)* 
+ %HKd’¥ (6)(Hu)[HKd*¥(8)(Hu)*]. (18) 


{The derivation of (18) uses gi(0, u) = Hu, (17), and the observation 
that k, + k2 = 3 is met if either kX; = 1 and ko = 2 or k; = 2 and 
ko = 1.] 

Proposition 5 provides an important interpretation of the terms in 
the expressions for the g,,(9, u). For example, by Proposition 5, we see 
that d*¥(6)(Hu)*, which appears in the first term on the right side 
of (18), is the element s of L..(R) given by s(t) = ¥{(0)[(Hu)(t);]* 
for t = 0 and each 7. Similarly, the ith component of d**¥(#)(Hu)- 
[HKd?*(6)(Hu)*] in (18) has values ¥!?(0)[(Hu)(t):]gi(t), where q = 
HKd’¥(6)(Hu)?. Of course, g also has an immediate interpretation. 


3.3 An application of corollary 1 


Corollary 1 is in many respects a local version of Theorem 1. Here 
we give an example of an application of the corollary. As in Section 
3.2, let Z = B = L.(R), and let K, ¥, J, and wo be as defined there, but 
here it is not required that A.1 be met. 

Let F:L..(R) — L..(R) be given by F = I + KW. As in Section 3.2, 
d™F exists and is continuous on L..(R) for m = 1, 2, --- , (p + 1). Of 
course, dF'(@) = I + Kd¥(6). : 

Let z be a complex scalar variable, and let K, the Laplace transform 
of k, be given by 


K(z) = | k(t)e“dt, Re(z)=0. 
0 


Assume that 
det[1, + K(z)diag{}{?(0), --- , ¥(0)}] #0 


for Re(z) = 0, in which 1, is the identity matrix of order n. As a 
consequence, it can be shown (see Lemma 2 in Section 3.5) that 
dF(@):L..(R) — L..(R) is invertible. Thus, by Lemma 1, there are open 
subsets S; and S2 of L..(R), each containing 6, such that F restricted to 
S, is a homeomorphism of S; onto So. , 
Therefore, the hypotheses of Corollary 1 are met if X is chosen to be 
S;, fis taken to be the restriction of F to X, and U is any open convex 
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set contained in S; and containing @. This establishes the existence of, 
and shows how to obtain, a series approximation with error o(|| u||?) as 
|| || + 0 of the locally unique solution x of F'x = u, for u of sufficiently 
small norm. 


3.4 Physical systems, and an application of theorem 4 


In the following, H(C) denotes the linear space of complex column 
n-vector-valued functions h defined on [0, ©) such that, with T,, the 
“time truncation” operator of Section 3.1, we have Th € L..(C) for 
w E [0, ©) (i.e., such that any truncation of h is bounded and measur- 
able). Clearly, unlike L..(C), H(C) can contain unbounded functions. 

Consider a physical system with an input v drawn from L..(C) and 
an output w contained in H(C). Let the system be composed of linear 
elements, as well as nonlinear elements. Suppose that the nonlinear 
elements can be viewed as collectively introducing a constraint that 
can be written as y = Nx, in which N is a map from one subset of H(C) 
into another, and where x and y, respectively, are the H(C) input and 
output of the nonlinear part of the system. 

With regard to the remainder of the system, which is linear, assume 
that there are linear maps A, B, C, and D of H(C) into itself such that 


x = Av + Cy (19) 
w = Du + By. (20) 


Concerning the degree of generality of the model, and the assumption 
that the values of v, w, x, and y have the same dimension n, notice 
that we have not ruled out the possibility that some components of v, 
x, and/or y have no effect on the system, and, similarly, that certain of 
the components of w can be ignored. Nonzero initial conditions, if any, 
are assumed to be able to be taken into account either in N or as 
inputs to the system. A signal-flow-graph representation of the rela- 
tions under consideration is given in Fig. 1.* 

In this section, we use Theorem 4 to obtain a result concerning the 
response w of the system to inputs vu having sufficiently small norm. 
To state the result, we introduce the following hypotheses and defini- 
tion. 

B.1: The restrictions of A, B, C, and D to L..(C) are bounded linear 
maps of L..(C) into itself. 

B.2: There are open neighborhoods S, and S2 of @ in L..(C) such that 
N restricted to S; is an invertible map of S; onto S:. The map WN also 
satisfies N(@) = @. 


* This type of representation of a system has been used in different but related 
settings in Refs. 39, 40, and 41. The maps A, B, C, and D exist for a very large class of 
systems, but it is not difficult to give examples in which one or more map does not exist 
(see Ref. 40, pp. 244-5, for a very simple linear example along these lines). 
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(INPUT) N (OUTPUT) 


Fig. 1—Signal flow graph. Fig. 2—Feedback part of the flow graph of Fig. 1. 


Definition: When B.2 holds, the inverse of the restriction of N to S; is 
denoted by ®. (Of course, ® maps S2 onto S;, and we have ®(@) = 6.) 
B.3: There is an open neighborhood So of 6 in L.(C) such that 
d”® exists on Sp for m = 1, 2,---. 
B.4: {d®(@) — C..] is an invertible map of L..(C) onto L..(C), where C.. 
is the restriction of C to L..(C). 
Theorem 6: When B.1, B.2, B.3, and B.4 are met, there is a positive 
number 6 and a neighborhood S of 6 in L..(C) with the following 
properties: 
(¢) For each v € L..(C) with || v|| < 5, there exist unique y, x, and w 
of S, S:, and L..(C), respectively, such that (19), (20), and y = Nx hold. 
(tz) The function w described in (i) is given by 


w= Dv+ y Bgm(Av) (21) 


m=1 


for ||v|| <8, in which g,(Av) = [d®(6) — C.J Av, and 
£m(Av) = —[d®(6) — C.J" 


m 


-y (2) y d'®(6)gx.,(Av)gr,(Av) -++ gx,(Av) 
=2 ky tkhot---+k=m 


k;>0 


for m = 2, and the series on the right side of (21) converges uniformly 
for ||v|| < 6. 
Proof: Using B.4, there are open neighborhoods S and 8S; of # in L..(C), 
with S C S», such that for each p € S; there is in S a unique y with the 
property that ®(y) — C.y = p (see Lemma 1). By the boundedness of 
the restriction of A to L.(C), 5; > 0 can be chosen so that Au € S3 
when uv € L..(C) and || v|| < 61, and thus so that for each such v there 
is a unique y € S such that ®(y) — C.y = Av. For each such v and its 
associate y, let x = ®(y) and w = Du + By. Observe that for || v|| < 4; 
the corresponding triple (y, x, w) has the properties indicated in (7). 
Now assume, as we may Without loss of generality, that S3 in the 
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preceding paragraph is convex, and that So of B.3 satisfies S C So. 
Choose %, Bo, X, U, and f, respectively, of Section 2.1 to be L..(C), 
L..(C), S, S3, and the map defined by f(s) = ®(s) — C.s for s € S. We 
have df = d® — C, on X, d"f = d”® for m = 2, 3, --- on X, and, by 
B.4, df(@) is an invertible map. Thus, by Theorem 4, there is a o > 0 
such that S; contains an open ball centered at @ of radius o, and the 
solution s € S of ®(s) — C..s = p for p € L..(C) with || p|| < o is given 
by the uniformly convergent series 


d Em(P), 
in which gi( p) = [d®(@) — C..]"'p, and where 


&m(p) = —[d&(A) — C.J” 2s (1) al Di bes d'®(6)gx,(p) +++ 8x P) 


kj>0 


for m = 2. Therefore, (ii) of Theorem 6 holds if for some 6 € (0, 61) we 
have || Av|| < o whenever || v|| < 6, and this condition is met because by 
B.1 the restriction of A to L..(C) is bounded. [Notice that here, and 
with regard to the uniform convergence of the series, we also use the 
boundedness of the restriction of B to L..(C).]} 


3.5 Volterra-series representations 


In this, our final section, we first consider the single-input case in 
which v in Theorem 6 satisfies [u(¢)]; = 0 for ¢= 0 and i> 1. We give 
a theorem which provides explicit conditions on A, B, C, D, and N 
under which the hypotheses of Theorem 6 are met and the series for 
w can be interpreted as a Volterra series in v; the function on [0, ©) 
whose values are [v(t)]:. Towards the end of the section, an n-input 
extension of our result is given. 

We will use the following definitions and hypotheses. 

Definition: For any positive integers / and q, let S{? denote the set of 
all functions A from the /-dimensional interval [0, 0)’ into the set of 
complex n X g matrices such that each h;; is measurable and bounded 
on [0, ~)’, and satisfies 


| | hij(71, T2, °° *% 4 TI) |d(7, T2, ***%, 1) < ©, (22) 
[0,00)é 


Definition: If r and s are two complex column n-vectors, then rs 
denotes the column n-vector defined by (rs); = ris; for i = 1, 2, +--+, n. 
Definition: “’L..(C) denotes L..(C) with n = 1. 

C.1: Referring to A, B, C, and D of (19) and (20), there are elements 
a, b, c, and d of S® such that for each p € L..(C), 
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t 


(Ap)(t) = | a(t — t)p(r)dr 
0 


(Bp)(t) = i b(t — 7)p(7)dr 
0 


(Cp) (t) = [ c(t — t)p(r)dr 
0 


(Dp) (t) = d(t — t)p(t)dr 
0 

for t= 0. 

C.2: For some y > 0, N of Section 3.4 is defined on T = {s € L..(C): 

Il sll < y} by 


[(Ns)(¢)]; = nA[s¢)]}, t20 (23) 


for 7 = 1, 2, ---, n, in which each n; maps complex numbers z with 
|z| < y into complex numbers such that n,(0) = 0 and such that 
dn,(z)/dz exists for | z| < y and is nonzero at z = 0. (Clearly, when C.2 
is met, N restricted to can be represented by n single-input single- 
output memoryless nonlinear operators.) 
C.3: det{l, — diag[dn(0)/dz, --- , dn,(0)/dz] ff c(r)e dr} ¥ 0 for 
Re(z) = 0.* 

A result (see Lemma 3, below) concerning elements of S{? that we 
shall use is the following: 


Proposition 6: If ki€ S? for some l, then the iterated integral 
t t 

| vee i kilt — 11, t — T2, +++ , & — 71) (ti)e(te) - ++ w(ti)dtidt2 «++ dry 
0 0 


exists for t= 0 and p € ‘YL..(C), and V;,(u) defined on [0, ) by 


Vi, (u) (2) -| vee | kilt — 11, t — 72, +++ , £ — 71) (71) w (72) 
0 0 


s+ w(t1)dtidt2 +++ dt; 


for an arbitrary p € L..(C), is an element of L..(C). 
Theorem 7: Suppose that C.1, C.2, and C.3 are met. Then 

(t) The hypotheses of Theorem 6 are satisfied. 

(ii) For each | = 1, 2, --- there is ak; ES? such that, under the 
condition that [v(t)]; = 0 fort = 0 andi > 1, we have 


* As in Section 3.3, 1, denotes the identity matrix of order n. 
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w= Y Vilvi1) for |lv|| <4, 
i=l 


with the series uniformly convergent for ||v|| < 6, where v, w, and 6 
are described in Theorem 6, and V,, (+) is as indicated in Proposition 
6. 

(iii) Each k; can be taken to be continuous* on [0, )' when a and 
d are continuous on [0, ©). 
Proof of Theorem 7: Hypothesis B.1 is clearly met. Also, with N as 
described in C.2, an easy modification of Proposition 4 shows’ that 
dN:L..(C) — L.(C) exists on I’, and that for p € Tl andh € L..(C), 


[dN (p)h](t)= D(t)h(t), t20, (24) 


where D(t) = diag{dn[ pi(t)]/dz, --- , dn,[ pa(t)]/dz}. 

Similarly, an easy modification of Proposition 5 establishes that 
d™N(p) exists for p € T and all m. (Observe that, because z is complex, 
the existence of dn,(z)/dz for each j and |z| < y means that 
the derivatives of each n; of all orders exist for |z| < y.) Since each 
dn,(z)/dz is not zero at z = 0, and hence each is nonzero throughout 
a neighborhood of z = 0, it is clear that the | dn;(z)/dz| are bounded 
away from zero on some neighborhood of z = 0. It follows from (24) 
that dN (p) is invertible for p in a neighborhood of 8 in L..(C). Thus, 
there are open neighborhoods S; and S2 of 6 such that N restricted to 
S;, which we denote by No, is an invertible map of S; onto S, and 
d™(No') exists throughout S2 for each m = 1, 2, --+ (see Lemma 1). 
Therefore, B.2 and B.3 are satisfied. 

Let Y:S, — S; denote No’, and notice that satisfies 


[((¥s)(t)]; = n7*{[s(t)])}, t20 


for each j and all s in some neighborhood of 6, where n;’, defined in 
a neighborhood of the origin of the complex plane, is a local inverse of 
nj. Since each dn;(z)/dz exists throughout a neighborhood of the 
origin, and does not vanish at the origin, we see that for each j and m, 
d”n;\(z)/dz™ exists in a neighborhood of the origin. Therefore, by a 
direct modification of Proposition 5, 


[d™Y(O)hy +++ hin(t) = d"nj*(0)/d2™ TT halt) (25) 


for t = 0, each m and 1, and any fy, he, --+ , Ay in L..(C). 
We shall use (25) subsequently. At the moment, concerning ¥, 


* Of course, by k; “continuous” we mean that each component of &; is continuous. 

t Notice that if dnj(z(p;(t) + Ajt)) + (1 — z) p,(t)]/dz exists at a point (a, 0), then 
dn,;[ B( p(t) + A,(t)) + (1 — B)p,(t)], with B a real variable, exists at 8 = a and the values 
of the two derivatives are the same. 
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note merely that [d¥(#)]"' = dN(@). Since [d¥(@) -— C.] = 
d¥(0){I — [d¥(0)]"'C.}, where I is the identity transformation in 
L.(C) and C,, is defined in B.4, by C.3 and Lemma 2 below, we see 
that the hypotheses of Theorem 6 are satisfied. 

In Lemma 2 we refer to the following. 
D.1: ) € S®, A denotes the map of L..(C) into itself defined by 


t 


(Ap)(t) = i A(t — tT) p(r)dr, t=0 


0 


for p € L..(C), and, with z a scalar complex variable, A(z) denotes 


| A(t)e~“dt, Re(z) = 0. 
0 


Lemma 2: Let D.1 hold, and suppose that det{1, — A(z)} # 0 for 
Re(z) = 0. Then 

(t) (I — A) is an invertible map of L..(C) onto itself, 

(ii) there isax €S such that 


t 
(I — A)“'p(t) = p(t) + | K(t — tT) p(r)dr, t=0 
0 
for p © L..(C), and 
(iit) if X is continuous for t = 0, then x can be taken to be con- 
tinuous on [0, ).* 
Lemma 2 is proved in Appendix D. 
We also need the following two lemmas which are proved in Ap- 
pendices E and F. 
Lemma 3: Suppose that h € S\? for some l= 1, that s ES!, and that 
u is a bounded measurable function from [0, ©)’ into the complex 
numbers. Then 
(i) With h defined on (—«, «)' by h = h on [0, @)' and h = Oni (the 
zero n X 1 matrix) otherwise, the function k, defined by 


R(a, 2, +++, a) = i s(nh(a — T,°*+,a:—T)dt 
0 
for (a4, a2, +++ , a1) € [0, 2)’, belongs to S{?. 


(ii) If h is continuous on [0, )’, then so is k. 
(tit) The iterated integrals 


t t 
| cee [ery eee yt dln eee yd oe dr 
0 0 


* Part (iii) is not used in the proof of Theorem 7, and is included because it is useful 
for other purposes. 
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and ; ; 
i see [Ble ty oo Erie ren oo de 
0 0 


exist, and are invariant with respect to interchanges of orders of 
integration, for t = 0; and p defined by 


t t 
p= | ce [ale ry auley oe yrds dr 02 0 
0 0 


is an element of L..(C). 
(tv) We have 


t T T 
| sen | | A(t — m1, +++,7 — 7 )U(T1, «++, TIdt1 +++ dridt 
0 0 0 


t t 
-| cee [Rt ny eee yt duly oo ren os t=0. 
0 0 


Lemma 4: If h € S\”) and k © S\, then the function s, defined* on 
[0, 20)?" by 
s(n, 91 Or wg Tp+q) = h(n, eae Tp) R(Tp+1; Se Ng Tp+¢q) 


for (11, ++, Tp+q) & [0, ~)?*%, belongs to S\?*”. 
We now return to the proof of Theorem 7. 
With v, w, and 6 as in Part (ii) of Theorem 6, we have 


w= Du+ > Bgm(Av), |u|] <6 
m=1 


in which the series converges uniformly, gi(Av) = [d¥(6@) — C..]"'Av, 
and 


&m(Av) = —[d¥(9) — C.J” 


SN dO) ga,(Av) «++ ga(Av), m= 2. 
l=2 ky thot++-+h=m 


Assume now that [u(¢)]; = 0 for ¢ => 0 andi > 1. By Lemma 3, it 
suffices to show that for each m there is a gm © S{” such that 


8m(Av) (t) 


t t 
-| eee i Qn(t — 71, eee, t — Tm)U1(71) ane Ui(Tm)dT1 o++ Atm (26) 
0 0 


* See the second definition at the beginning of Section 3.5. 
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for ¢ = 0 and ||v|| < 5, and that qg,, is continuous on [0, ©)” when a is 
continuous on [0, ©). Thus, suppose for the purpose of induction that 
(26) holds with g, as indicated for m = 1, 2, --+, p for some p = 1. 
Observe that by Lemma 3 and Part (ii) of Lemma 2 the induction 
hypothesis is met for p = 1. 

We have 


Epri(Av) = —[d¥(6) — C.J” 

(p+1) 

-Y (2 Yd 4H (8)gu,(Av) «++ gx(Av). (27) 
[=2 hythyt-.++hk= pt] 
hj>0 

Now let / = 2 be fixed, and let hi, --- , k; be any / positive integers 
such that k} + kp + --- +k; = p+ 1. By Lemma 4, (25), and Lemma 
3, r defined by 


r(t1, *++, Tp+1) 
= Gz, (71, eooe 9 Th) Qk (Thy+1s eee ’ Thytha) eoe Phe AT hye. + 0+hy 441s eee .) Tp+1) 
for (71, a Tp+1) S [0, 00) ao belongs to Si; and 


d'¥ (0) g.,(Av) +++ gp(Av)(t) 
= D | tee i r(t — 71, +++, £ — Tp41)Ui(T1) © ++ Ui(Tp+i)dT1 «++ ATp+1 
0 0 


for tf = 0, in which D is the diagonal matrix of order n whose jth 
diagonal element is d’nj1(0)/dz’. Observe that r is continuous on 
[0, 0)?** when each gz, is continuous on [0, «)* 

Therefore, using (27), and by Lemmas 2 and 3, there is a gp+1 € 
S{?*), which is continuous on [0, %)?*! when a is continuous on 
[0, ©), such that (26) holds with m replaced with (p + 1). This 
completes the proof of the theorem. 


Comments: By Lemma 2 and Proposition 7 (in Appendix G), an 
interpretation of C.3 is simply that the “feedback part” of the graph of 
Fig. 1, shown in Fig. 2, is bounded-input bounded-output stable* when 
C.1 and C.2 are met, N is replaced with its linearization dN (@) extended 
in the natural way to all of H(C), and C:H(C) — H(C) is defined by 
(Cp)(t) = So c(t — r) p(t)dr, t= 0 for each p € H(C). 

The k; in Theorem 7 can be taken to be real valued (i.e., to have 


* By this we mean that for each L..(C) input to the graph of Fig. 2, there is in H(C) 
a unique output, and the output belongs to L..(C). In this case, existence and uniqueness 
of an output in H(C) follows from the hypotheses concerning c via the usual successive 
approximations approach (Ref. 38, Section 1.13). In Fig. 2, J denotes the identity 
transformation in H(C). 
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zero imaginary part) if a, b, c, and d are real valued, and dn; '(0)/dz” 
is real for all m and /. (It is not difficult to show that thed”n;*(0)/dz™ 
are real when dn,(0)/dz” is real for every m and j.) In particular, we 
see that Theorem 7 establishes the existence of a Volterra-series 
expansion for the important corresponding case in which uv; w, x, and 
y in Fig. 1 are restricted to be real valued, C.1 is met, and N (which 
then would be a map between real-valued function spaces) can be 
analytically extended so that C.2 and C.3 are satisfied.** In this 
connection, Theorem 5 can be used to prove results along the same 
lines as Theorems 6 and 7, but with L..(C) replaced throughout with 
L..(R). Similarly, Corollary 1 can be used to obtain corresponding pth 
order approximation results under weaker differentiability hypotheses 


concerning N. 
Theorem 6 provides explicit expressions for the g,,(Av). For example, 


gi(Av) = [d¥(8) — C.J Av 
£2(Av) = —4[d¥(6) — C..]1d2¥(6) {[d ¥(8) — CJ Av}? 
g3(Av) = —[d¥(8) — C.J-*d*¥ (8) g1(Av) go(Av) 

— YV[d¥ (0) — Co} 'd°*¥(A)[ gi(Av)}’, 


and so on. Therefore, assuming merely that we can write down the 
representation of [d¥(@) — C..] along the lines of Part (iz) of Lemma 
2, notice that it is not difficult in principle to give an explicit expression 
for any of the k; of Theorem 7. 

Theorem 7 (and Proposition 6) can be extended to cover the case in 
which the restriction that [v(t)]; = 0 for t= 0 and i > 1 is not met.? 
Specifically, using the results and techniques described in connection 
with the proof of Theorem 7, it is not difficult to prove the follow- 
ing extension in which for each J, x[v(71), «++, u(7:)] denotes the 
column vector of order n’ whose elements are the n’ distinct 
products v,,,(71)U.,(t2) +++ U.,(7:), corresponding to distinct sequences 
W1, We, +++ , Ww With each w; drawn from {1, 2, --- , n}, arranged in an 
arbitrary predetermined order. 


* Observe that this extendability condition is often met. (In particular, polynomial 
nonlinearities frequently arise in locally-valid models, and polynomials in z are entire 
functions.) 

tTheorem 7 bears on problems concerning the transmission of digital signals over 
analog communication channels. Discussions with this writer’s colleague, J. Salz, con- 
cerning such problems provided part of the motivation to formulate the system model 
in Section 3.4 and to develop a theorem along the lines of Theorem 7. 

* The case of more than one input is of importance, for example, in connection with 
studies of the effects of initial conditions. Also, straightforward modifications suffice to 
ae cee results for the time-discrete case in which ¢ takes values in 
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Theorem 8: Under the fypotheses of Theorem 7, for every | = 
1, 2, +++ there is a k, E S“’, with ki continuous on [0, 0)’ when a and 
d are continuous on [0, ©), such that 

(t) The iterated integral 


[ - fa hilt — ty «++ 8 T)xlv(n), «++, ved drs «+ dr 


exists fort = 0 and v € L..(C), and V;,(v) defined on [0, ») by 
Vz (v) (t) 


-| see | kit — 71, +++ ,€—72)x[v(11), «++ , v(t1) dr +++ ari 
0 0 


for any v € L..(C) is an element of L..(C). 
(i) With V;,(-) as indicated in (t) above, and with v, w, and 6 as 
described in Theorem 6, the expansion 


w= Y: Viv) 
l=1 
converges uniformly for ||v || < 6. 

It is not difficult to verify that Theorems 7 and 8 remain valid if C.1 
is modified to allow a constant multiple (or, more generally, a constant 
n X n-matrix multiple) of the identity operator on H(C) to be added 
to B. 

We do not give here the details of other extensions of Theorem 7,” 
but it is worthwhile to appreciate that in some extensions in which C.1 
is weakened, series representations can arise in which, unless general- 
ized-function kernels are admitted, the terms do not have the form 
normally associated with a Volterra series. Consider, for example, that 
if n = 1, if A, as well as B, is the identity transformation on H(C), if C 
and D have the representations on L..(C) given in C.1, and if C.2 and 
C.3 are satisfied with d?n71(0)/dz? = 2, then the hypotheses of Theo- 
rem 6 are met and the second term in the sum in (21) is a function 
whose values are 


* We leave for another paper results concerning cases in which N can be of a more 
general form, and the restrictions to L.(C) of A, B, C, and D are not necessarily time 
invariant, and generalized functions may be involved in their representations. Also left 
for a later paper are applications to differential equations. Assuming merely that A, B, 
C, and D are defined on all of H(C) as convolution operators with kernels a, 0b, c, and d, 
it is a simple corollary of Theorem 8 that there exists a Volterra-series expansion also 
for the case in which the conditions of Theorem 8 are met, with the exception that the 
dn,(z)/dz are not necessarily nonzero at z = 0. More specifically, N can be replaced 
with N plus a multiple BJ of the identity operator J on H(C), with |f| sufficiently small 
that the four linear parts of the system can be modified accordingly and remain S 
convolutions. 
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— p*u(t)? — 2p7v(t) | k(t — r)v(r)dr + p” [ k(t — r)v(r)°dr 
0 0 
+ 2p | k(t — r)v(r) [ R(t — 1)vu(1) ddr 
0 0 


t E 
+ [ | k(t-1,t- T2)U(11)U(T2)dt1d72, 
0 40 
in which p is a nonzero constant, k € S{” and ka € S?”. 


APPENDIX A 
Proof of Proposition 1 


Let o be real, nonzero, and such that (7o + o) € A. Then, using the 
linearity of L(7o + 0), 


o [L(ro + o)e(ro + «) — L(ro)e(ro)] 
= L(ro + o)o[e(ro + 0) — e(7o)] + 0 [L(7r0 + «) — L(ro) Je(ro) 
= L(ro)de(ro)/dr + [dL (ro)/drje(ro) + Ai(o) + Ac(o) + As(o) + Ag(o), 
where 
Ai(o) = L(ro) {a "[e(ro + a) — e(r0)] — de(ro)/dr]} 
Ao(o) = [L(ro + «) — L(ro)]{o"[e(ro + «) — e(ro)] — de(ro)/dr} 
A3(o) = [L(ro + o) — L(ro)|de(ro) /dr 
As(a) = {0 '[L(r0 + 6) — L(r0)] — dL (70) /dr} e(r0). 


Since L(ro) is a bounded operator and ||o7"[e(ro + o) — e(ro)] — 
de(ro)/dr|| > 0 as o— 0, Ai(c) > 0 in S as o > 0. It is clear that A(o), 
A3(a), and Ay(o) approach zero in S as o—> 0. This completes the proof. 


APPENDIX B 
Part of the Proof of Theorem 1 


Let k be any integer such that 1 <= k S p, and let v;(-), vo(-), +++, 
u,(-) denote & maps from an open subset of (—%, «) containing [0, 1] 
into Z such that each u,(-) is differentiable on [0, 1]. With Z an integer 
such that 0 < 1 < k — 1, consider d*f[ q(B)]ui(B) +++ vz-1-1(B)vz-A(B). 
Since d’*f[ q(8)] for 0 < B <= 1 is a Fréchet derivative, d’f[ q(B)]v:(B) 

- Up-1-1(8) is a bounded linear map from & into a Banach space S 
for each 6 & [0, 1]. By the version of the chain rule in Ref. 31, p. 173, 
d{d*f{ q(B)]}/dB exists for B & [0, 1], and 
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d{d*f[ q(B)}}/dB = d**"f[.9(8)]g%(8), BEl0,1]. (28) 


By Proposition1, (28), and an obvious inductive argument, 
d{d"f[ q(B)]ui(8) -++ ve-1-1(8)}/dB exists for B € [0, 1]. Thus, by 
Proposition 1, d{d*f[ q(B)]vi(B) --+ ve-i(B)}/dB exists and satisfies 


d{d*f[ q(B)]ux(B) «++ ve-(B)}/dB 
= d*f[ q(B)]ui(B) «++ ve-1-1(8)[duz-(B)/dB] 
+ d{d"f[ q(B)]ui(B) «++ Ue--1(B)}/dB ve-(B) (29) 


for B € [0, 1]. 

By relations (28) and (29), and the fact that df[q(@)]/dB = 
df[.q(B)]q"() for 0 = B <1, we see that an expression E,,(8) can be 
given for df[ q(8)]/dB” for B E [0, 1] which depends only on d’f[ ¢(8)] 
and q(B) for J = 1, 2, --», m. For example, E2(8) = df[ q(8)]lq™ (8) 
+ d’f[.q(B)Jq (B)q (8). Since df[.q(8)]/dB = (u — uo) for B € [0, 1], 
we see that E,,(B) = @ for B € [0, 1]. 

Now consider (5) and (6). Since d’f[ go(r)] and d’qo(r)/dr’ exist at 
r = B, with d'fol go(8)] = d'f[.9(8)] and d'qo(r)/dr’ = (8) for r = B 
and / = 1, 2, --- ,m, by Proposition 1 and observations similar to those 
of the preceding three paragraphs, we see that, as claimed in Section 
2.2, a” fol q(r)]/dr”™ | =, exists and that it equals £,,(6). 


APPENDIX C 
Proof of Proposition 2 


Under the conditions indicated, dg(uo) exists (and equals L). Thus, 
using f[ g(u)] = u for u € U, and with I the identity operator on %, we 
have df[ g(uo)|dg(uo) = I. This shows that df[ g(uo)] has a right 
inverse. 

On the other hand, for u € U with ||u — woll < o, g(u) — g(uo) = 


Lflg(u)] — Lflg(uo)] + RL g(u)] — FL g(uo) J} and, thus, 
{Ip — Ldf[ g(uo) ]}[ g(u) — g(uo)] = R{df[ g(uo) IL g(u) — g(uo)] 
+ Ril g(u) — g(uo)]} + Ref g(u) — g(uo)] (30) 


in which Ip is the identity operator on A, Ri(h) = o(||A||) as ||A]] > 0, 
and, using the boundedness of L, Ro(h) = o(||h||) as ||h|| > 0. 

Now let v € & be arbitrary. By the continuity of f at g(uo), and the 
openness of X and U, choose 8 > 0 so that g(uo) + av € X, 
fle) + av] — f[_g(uo)]|| < 6, and f[g(uo) + av] € U for |a| < 
B. Notice that for each a with |a| < 8, and with u = f[ g(uo) + av], 
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we have || uz — wo|| < a, as well as f[ g(u)] = f[ g(uo) + av], and hence* 
g(u) = g(Uo) + av. 
Therefore, by (30), 


{Io — Ldf[ g(uo)]}v = a" R{df[ g(uo) av + Ri(av)} + a" Ri(av) (31) 


for 0 <|a|< . Since the right side of (31) approaches @ as a > 0, and 
since v is arbitrary, it follows that df[ g(uo)] has a left inverse. Since 
df[ g(uo)] has both a left inverse and a right inverse, df[ g(uo)]~’ exists. 
Finally, a standard type of argument shows that (df)~’ exists and is 
continuous throughout some neighborhood of g(uo) when f is contin- 
uously differentiable on a neighborhood of g(uo) (see the proof of 
Corollary 1 and the references given there). 


APPENDIX D 
Proof of Lemma 2 


In the following, we use L; to denote the set of complex-valued 
functions summable over [0, ©). 

For Re(z) = 0, we have [1n — A(z)]}7 = M(z) {det[1, = A(z)]}7?, in 
which M is the matrix of transposed cofactors of [1, — A(z)] ifn >1 
and M = 1 if n = 1. Since A, € ZL; for each i and j, and the convolution 
of any two bounded L; functions belongs to LZ; and is bounded, it 
follows that there is a g € Li, and anr € S®, such that 

det[1, — A(z)] =1—- i q(t)e “dt 
10) 
M(z)A(z) = | r(t)e~“dt 
0 
for Re(z) = 0. . 

Since det{1, — A(z)] ¥ 0 for Re(z) = 0, it follows (see Ref. 42, pp. 

60-63) that there is an element s of Z; such that 


{det[1, — A(z)}}7 =1+ il s(r)e “dr, Re(z) = 0.7 
0 
Thus, « defined by 
k(t) = r(t) + | r(t — r)s(r)dr, t=0 


0 


* Here we use the hypothesis that for each u € U, there is exactly one x € X such 
that f(x) = u. 7 

ones that when A(z) is rational in z, it is a simple matter to show the existence of 
such an s. 
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belongs to S{”, and H(z) given by 
H(z) = i k(t)e~*"dr, Re(z) =0 
0 


satisfies H(z) = [1, — A(z)] *A(z) for all Re(z)=0. | 
Since [1, — A(z)]’A(z) = A(z) + A(z)[In — A(z)] A(z) for 
Re(z) = 0, we have 
t 
x(t) = A(t) + | A(r)K(t — t)dr (32) 
0 
for almost every ¢t = 0.* 
For an arbitrary p € L..(C), let g € L..(C) be defined by 


t 
q(t) = p(t) + : k(t — 7)p(r)dr, t=0. (33) 
0 
We have for t = 0, 


t t 


K(t — t)p(t)dt — i A(t — t)p(t)dr 


0 


q(t) — (Aq) (t) = p(t) + i 


0 


-| ne=n) | K(T — 1)p(12)dr2d71. 
0 0 


Since 


t T 
i A(t — 71) | K(T1 — T2)p(T2)dt2d7 
0 


0 


can be expressed as 


t t-B 
| i A(a)«(t — B — a)p(B)dadB 
0 40 


for ¢ = 0 [the justification of the interchange of order of integration 
being provided by Theorems of Fubini and Tonelli (Ref. 43, pp. 137- 
45)], we have, using (32), (I — A)q = p. Thus, (J — A) maps L..(C) onto 
itself. Similarly, (32) holds with A and « interchanged in the integral, 
and (32) so modified can be used to show (see the proof of Theorem I 
of Ref. 39) that whenever there is a solution g € L..(C) of (J — A)g = 
p with p € L..(C), then (33) holds. This establishes Parts (z) and (iz) 
of the lemma. 

Suppose now that A is continuous on [0, 0). Since x € S, by Part 


* Equation (32) is a matrix-valued-function version of the usual equation for the 
resolvent kernel. 
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(it) of Lemma 3, the integral in (32) depends continuously on ¢ for 
t = 0. Thus, by (32), the values of « agree almost everywhere on 
[0, 0) with those of a continuous function. This completes the proof of 
the lemma. 


APPENDIX E 
Proof of Lemma 3 


Consider k. That each kj: is measurable on [0, 0)’ and satisfies 


i |Ri(t1, +++, 71) |d(T1, +++, 77) < © 
[0,00)? 


follows from a direct application of Theorems of Fubini and Tonelli 
(Ref. 43, pp. 1387-45). (See the proof in Ref. 44, pp. 99-100, for the 7 = 
1 case.) Since every s;; is summable over [0, ©), and h is bounded, we 
see that & is bounded. Thus, (z) holds. 

Suppose now that A is continuous on [0, «)’. Let am = 0 be given for 
m=1,2,---,J, let 6, for m = 1, 2, --- , 1 be real variables such that 
each (am + dn) is nonnegative, and let A be defined by 


foo] 


A(a + 8) -| si (tT) Ajilon +6,—7,-++,a:+ 6:—17)dt 
0 


for any i and j. Let y > 0 be given. With 0, and 62 such that 
\Aji(m, «++, T2)| S db: and |sy(r)| S be for (1, -+» , 72) € [0, )’ and 
t € [0, ©), choose 79 € (0, ©) so that 


> 1 
2b | |su(t)|dr => 7, 
and observe that 
| A(a + 5) — A(a) | 
1 anes 
<py+b| |Aji(ar + 61 — 7, «++, ar + 8;—7) 
0 


_ Ay(a1 — 7, Nope , a1 — T)| dr. (34) 


Since y is arbitrary, and, by the boundedness and uniform continuity 
of h on compact subsets of [0, )’, the value of the integral in (34) can 
be made arbitrarily small by choosing 


Z 
y [dm| 
m=1 


to be sufficiently small, we see that & is continuous on [0, %)', which 
proves (iz). 
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Since A(t — 7, «++, ¢— 7)u(n, +++, 71) and R(t — 1, ---,t—- 
71) (71, +++ , 7%) are bounded and measurable on (7, --- , 7) & [0, t]’, 
the multiple integrals 


| h(t—m,-++,t—7)u(t, -+-, T:)d(t1, -°*, T1) 
[0,2 
and 


[ R(t —1, +++ ,t—71)u(ti, -*+ , t2)d (71, -** , T1) 
[0,¢]! 


exist. Therefore, two repeated [i.e., two (J — 1)-fold] applications of 
Fubini’s theorem (Ref. 43, p. 1387) show that the iterated integrals in 
(tit) exist, that each equals the corresponding multiple integral, and 
that each is invariant under changes in the order of integration. Notice 
that the existence of 


[|| cee [Rtn eee teddy 22rd + de (35) 
0 0 0 


for any ¢t > 0 can be established in essentially the same way. 
Now let p be defined on [0, ~) by 


po-| oe [Remy tm 
0 0 


-U(T1, +++, T1)dT1 +++ Ati, t=0. (36) 


Since h € S{?, and u is bounded on [0, ~)’, it is clear from the 
relationship between the iterated integral in (36) and the corresponding 
multiple integral that p is bounded on [0, ©). That p is measurable on 
[0, ©), is a consequence of the existence of (35) for all t > 0. 

Similarly, again using Fubini’s theorem and the fact that a bounded 
measurable function defined on a set E of finite measure is summable 
over E, we have, for any ¢ = 0, 


t T T 
[ «-» | cee [Rem ty ee teal oo+ yids +++ ded 
0 0 0 


t t—r t—r 
= | se | a | A(t-—TtT—m1, +-++,t-7—T1) 
0 0 0 


-U(T1, © +, TIT, -++ Atidt 


t 
= | se A(é—7 — 1, «6+ ,t-7 — 7) 
0 [0,¢]! 
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-u(T1, °°* , TAT, +++ Atidt 


t 
-| [ stofte-r- ryt = mar 
0.1! Jo 


-u(n1, eee t1)d (71, ve 71) 


-| k(t—7, +++ ,¢—7)u(n, ++, 71)d(t1, +++, T1) 
[0,t}! 


t t 
-| eee | k(t — 71, coe ,t—71)u(n1, eee , Tat, eee dri, 
0 0 
showing that (iv) is met. This completes the proof of the lemma. 


APPENDIX F 
Proof of Lemma 4 
By Fubini’s theorem (Ref. 43, p. 137), and the proposition that 


bounded measurable functions on a set E£ of finite measure are sum- 
mable on E, 


| | Aix (73, Lue8 Tp) Ra (Tp+1, ey Tp+q) |d (nr, oO ty Tp+q) 
(0,7 ](P+9) 
=| |Ai(t1, +++, T)|a(n, 7+, Tp) 
[0,co) P 


7 | Ria (Tp+1, ne) Tota) | d(tp+1, eng Tp+q) 
[0,c0) 7 


for each i and any finite T > 0, from which it is clear that the lemma 
holds. 


APPENDIX G 

On the Necessity of the Condition That Det[1, — A (z)] #0 for Re(z)=0O 
Proposition 7: Let D.1 (which appears just before Lemma 2) hold. If 
for each gq € L..(C) there is a p € L..(C) such that (I — A)p = q, then 
det[1, — A(z)] ¥ 0 for Re(z) = 0. 

Proof. 

Since \ € S{, by a standard successive apprdximations approach 
(Ref. 38, Section 1.13), it can be shown that there is an n X n matrix- 
valued function x defined on [0, ©) such that each kj is square 
summable on any finite interval [0, 8], and 


t 


k(t) = A(t) + | A(r)k(t — r)dz, t=0 (37) 


0 


EXPANSIONS 197 


(i.e., and such that (32) is met for ¢ = 0). From (37) and the Schwarz 
inequality, the «x; are bounded on finite intervals. Using Fubini’s 
theorem (see the proof of Theorem I of Ref. 39), if (J — A) p = q with 
g and p in L..(C), then 


t 


p(t) = q(t) + | K(t — r)q(r)dr, t= 0. 


0 


Thus, by the hypothesis of the proposition, each kj is summable on 
[0, 0). In particular, the Laplace transform H(z) of x, given by 


A(z) = i] x(t)e~“dt 


0 


exists for all Re(z) = 0. 

We have (J — A)(I + H)q = q for every g € L..(C), in which H is the 
convolution transformation defined in L..(C) in the usual way in terms 
of x. Now let g be given by q(t) = e~‘c; for t = 0, in which ¢; is the 
column n-vector whose ith component is unity and all other compo- 
nents are zero. Upon taking the Laplace transform of both sides 
of (I — A)(I + H)q = q, we find that [1, — A(z) {dn + A(z) ci = C; 
for Re(z) = 0 and each i. Therefore, [1, — A(z)][In + H(z)] = ln 
for Re(z) = 0, which shows that det[1, — A(z)] ¥ 0 for Re(z) = 0. 
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Recent results show for the first time the existence of a locally 
convergent Volterra-series representation for the input-output rela- 
tion of a certain important large class of time-invariant nonlinear 
systems containing an arbitrary finite number of nonlinear elements. 
(Systems of the type considered arise, for example, in the area of 
communication channel modeling.) Here corresponding results are 
given for time-varying systems, which arise frequently. A key hypoth- 
esis of our main theorem, which asserts that a convergent Volterra 
expansion exists under certain specified conditions, has the useful 
property that it is met if a certain “linearized subgraph” of the system 
is bounded-input bounded-output stable. 


|. INTRODUCTION 


This paper is a continuation of the study initiated in Ref. 1 concern- 
ing operator-type models of dynamic nonlinear physical systems, such 
as communication channels and control systems. Reference 1 addresses 
the problem of determining conditions under which there exists a 
power-series-like expansion, or a polynomial-type approximation, for 
a system’s outputs in terms of its inputs. Related problems concerning 
properties of the expansions are also considered, and nonlocal as well 
as local results are presented. In particular, the results in Ref. 1 show 
for the first time the existence of a locally convergent Volterra-series 
representation for the input-output relation of a certain important 
large class of time-invariant systems containing an arbitrary finite 
number of nonlinear elements. 

The main purpose of this paper is to give corresponding results 
applicable to time-varying systems, which arise frequently. Also, the 
results obtained here by specializing to the time-invariant case involve 
weaker hypotheses concerning the nonlinear elements (here mutual 
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coupling is not ruled out, and, at the expense of somewhat more 
complicated proofs, we show how to proceed without the local inver- 
tibility of a certain mapping associated with the nonlinear elements”). 

With regard to background material, functional power series of the 
form 


33 b b 
kot+ ¥ | vee | Rm(t, 71, +++, Tm)U(Ti) +++ Ultm)dt1 +++ dtm, (1) 
m=] Jq a 


in which & is a constant, ¢ is a parameter, and u and the k,, for m = 1 
are continuous functions, were considered in 1887 by Vito Volterra” ° 
in connection with his studies of functions of functions (which provided 
much of the initial motivation to develop the field now known as 
functional analysis). About twenty years later, Fréchet* proved that a 
continuous real functional (i.e., a continuous real scalar-valued map) 
defined on a compact set of real continuous functions on [a, b] could 
be approximated by a sum of a finite number of terms in Volterra’s 
series (1), but with (in analogy with the well-known Weierstrass 
approximation theorem) the number of terms as well as the k,, de- 
pendent on the degree of approximation. Further background material 
(concerning, in particular, bilinear and polynomic systems) omitted 
here to avoid unnecessary repetition, can be found in Ref. 1. 

Our results are given in the next section, which begins with some 
mathematical preliminaries followed by a description of the general 
class of systems to be addressed. Of interest with regard to our main 
result, Theorem 2 below, is that a key hypothesis has the useful 
property that it is met if a certain “linearized subgraph” of the system 
is bounded-input bounded-output stable. 


ll. SYSTEMS AND EXPANSIONS 
2.1 Preliminaries 


Throughout Section II we use L..(C) to denote the complex Banach 
space of Lebesque measurable complex column n-vector-valued func- 
tions v defined on the interval [0, ©) such that the 7th component u; of 
v satisfies sup;>o| u;(t)| < © for 7 = 1, 2, ---,, and where the norm 
|| «|| on Z..(C) is given by || v|] = max; sup,| v;(¢) |. (As usual, m denotes an 
arbitrary positive integer.) The symbol @ stands for the zero element 
of L..(C). We use H(C) to denote the linear space of complex column 
n-vector-valued functions h defined on [0, ©) such that truncations of 
h belong to L..(C) (i.e., such that Ay.) € L.(C) for w € (0, ©), where 


*A simple way to circumvent the need for invertibility mentioned in (Ref. 1, 
Comments of Section 3.5) is often much less satisfactory for the purpose of obtaining 
explicit expressions for the Volterra kernels. 
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p / q 
(INPUT) (OUTPUT) N 
Fig. 1—Signal flow graph. Fig. 2—Feedback part of the flow graph of Fig. 1. 


hy.) (€) = A(t) for ¢ S w and h,,)(t) = 0 otherwise). Clearly unlike L..(C), 
H(C) can contain unbounded functions. 


2.2 The class of systems 


Consider a physical system with an input v drawn from L..(C) and 
an output w contained in H(C). Let the system be composed of linear 
elements as well as nonlinear elements. Suppose that the nonlinear 
elements can be viewed as collectively introducing a constraint that 
can be written as y = Nx, in which N is a map from one subset of H(C) 
into another, where x and y, respectively, are the H(C) input and 
output of the nonlinear part of the system. 

With regard to the remainder of the system, which is linear, assume 
that there are linear maps A, B, C, and D of H(C) into itself such that 


x= Aut Cy (2) 
w= Du + By. (3) 


A signal-flow-graph representation of the relations under consider- 
ation is given in Fig. 1.* Concerning the degree of generality of the 
model, and the assumption that the values of v, w, x, and y have the 
same dimension n, notice that we have not ruled out the possibility 
that some components of v, x, and/or y have no effect on the system, 
and, similarly, that certain of the components of w can be ignored. 
Nonzero initial conditions, if any, are assumed to be able to be taken 
into account either in N or as inputs to the system. 


2.3 General expansions 


Consider three hypotheses: 
A.1: The restrictions of A, B, C, and D to L..(C) are bounded linear 
maps of L..(C) into itself. 
A.2: There is an open neighborhood Sp of @ in L..(C) such that N maps 


* This is the same class of systems introduced in Ref. 1. Such representations of 
systems have been used in different but related settings in Refs. 5, 6, and 7. The maps 
A, B, C, and D exist for a very large class of systems. 
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So into L.(C) with N(@) = 6, and d”N (the mth order Fréchet 
derivative of N) exists on So for every m = 1, 2, ---. 
A.3: [I — C.dN(6)] is an invertible map of L..(C) onto L..(C), in which 
Tis the identity transformation on L..(C), and C., is the restriction of 
C to L..(C).* 

We shall prove the following general result. 
Theorem 1: When A.1, A.2, and A.3 are met, there is a positive 
number 6 and an open subset S of So with the following properties: 

(i) 6 ES, and for each v € L..(C) with ||v|| < 5 there exist unique 

x, y, and w of S, L..(C), and L..(C), respectively, such that (2), (3), and 
y = Nx hold. 

(tt) The function w described in (i) is given by 


w= Du+ B[ gm(Av) Je (4) 


for ||v|| < 6, in which the [gn(Av)]2 are defined recursively by the 
relations 


[ gi(Av)],: = [I — C.dN(6)]""Av (5) 
[ gi(Av) ]2 = dN(6)[ gi(Av)]i (6) 


and 


7 Gy Y 'N()L gn,(Av) h 


Rythyt++-+k=m 


k;>0 
‘[gn,(Av)hi +++ [gn(Av)]i (7) 
[gm(Av)]: = [I — CudN(0)]"Cohm (8) 
[ 8m(Av) Je = AN(@){ gm(Av))i + Am (9) 


for m = 2. In addition, the series on the right side of (4) converges 
uniformly with respect to ||v|| < 6. 


2.3.1 Proof of Theorem 1 


Notice that (2) and y = Nx can be written as x — CNx = Av and 
Nx — y = 6, for y and Nx belonging L..(C), and that an expansion for 
w in terms of uv can be obtained at once from (3) and an expansion for 
y in terms of v. These observations motivate us to proceed as follows. 


* Of course, dN(@) denotes the Fréchet derivative of N at the point 6. 
TIn (7), denotes a sum over all positive integers ki, --- , k; that add to 


Rkythgt-.-+ky=m 
kj>d 


t An alternative way not pursued here to prove a result along the lines of Theorem 1 
involves obtaining an expansion for x in terms of v, one for y in terms of x, and 
substituting the former into the latter. 
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Let Z denote the Banach space L..(C) X L..(C), whose elements we 
take to be two-component column vectors, normed by || u|| = max(|| ||, 
|| ||) for (wi, U2)’ € Lw(C) X L.(C), in which “’” denotes transpose. 
[We use the same symbol for the norms associated with # and L..(C). 
The meaning of the symbol will be clear from the context in which it 
is used.| Let S,.be any open ball in L..(C) of positive radius centered 
at 6. 

Define F: So X Si > & to be the map given by 


F (pi, P2) = pi — C.Npi 
F2( pi, p2) = Npi — p2 


for p © So X Si. 

The set So X S; is open in &. By A.2 it easily follows that the 
derivative dF(p):% — @ exists and is continuous at each point p of 
So X Si, and that it is given by 


_{U-C.dN(pi)] 0 


in which here “0” denotes the transformation of L..(C) into itself that 
replaces each element by 6. By A.3, it follows that dF(p) is an 
invertible map of # onto &, with inverse given by 


1_(  U-CedM(pyy? 
dF(p)' = (an! pI — C.dN(p1)}™ 4) 


for p © So X S\. Since dF( p) is invertible at p = (0, 8)’, by a standard 
inverse function theorem (Ref. 8, page 273; see also the comment in 
Ref. 1 concerning Lemma 1 of Ref. 1), there are open neighborhoods 
S. and S3 of (9, 8)’ in Z, with Sz C So X Si, such that for each g € Ss; 
there is in S2 a unique p such that Fp) = q. Using the bounded- 
ness of the restriction of A to L..(C), 6: > 0 can be chosen so that 
(Av, 6)’ € Ss for v € L..(C) with ||v|| < 51, and thus so that for each 
such v, there is in S2 a unique (x, y)’ with the property that F(x, y) = 
(Av, 6)’ [i.e., such that (2) and y = Nx are met]. 

Observe that the set S = {u:(u, Nu)’ € S2} is an open subset of So, 
and that for any 6 € (0, 6:) and for each v € L..(C) with ||v|| < 6, there 
is a unique (x, y, w) in S X L..(C) X L..(C) such that (2), (8), and y = 
Nx hold, as claimed in (2). 

With regard to part (iz), we shall use the following Lemma in which 
f denotes any map from an open subset X of the Banach space # into 
BZ with the property that there is a nonempty open convex subset U 
of Z such that for each u € U there is in X a unique x, such that 
/(xu) = u, and in which g stands for the map of U into X defined by 
flg(u)] =u foru€ U. 


(11) 
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Lemma 1: Assume that the Fréchet derivative d™f exists on X for 
each m. Let uo € U, and suppose that df[ g(uo)] is an invertible map 
of & onto itself. Then there is a o > 0 such that the expansion 


g(u) = g(uo) + x 2m(Uo, U — Uo) 


is valid and uniformly convergent for u € U with ||u — || < o, where 
81 (Uo, U — Uo) = df[_g(uo)] (wu — uo), 


and 


Bm(Uo, U — Uo) = —df[ g(uo)]’ Py ay : y , d'f[ g(uo)] 
eo 


* 8p, (Uo, U — Uo) Bx,(Uo, U — Uo) +++ Bx,(Uo, U — Uo), m= 2. 


Lemma 1 is a special case of Theorem 4 of Ref. 1 (see also Ref. 9). 

With S, and S3 as indicated above before Lemma 1, choose X = So, 
assume without loss of generality that S3 is convex, and take U = S3. 
Throughout the remainder of this section, let f denote the restriction 
of F to X. The following lemma is proved in Appendix A. 
Lemma 2: Under the conditions of Theorem 1, for each p € X and 
every 1 = 2, 3, --- the lth order Fréchet derivative d'f(p) exists, and 
we have 


—C..d'N(pi)huhas +++ day) (12) 


I — 
d f(p)Arhe hy == ( d'N(pi)huha Shs hn 


for any elements hi, hz, +++, hi of & (where hj; denotes the first 
component of h; for each j). 

By Lemmas 1 and 2, there is a o > 0 such that S; contains an open 
ball in # centered at (0, 0)’ of radius o, and the solution r € X of 
f(r) = s for s € & with ||s|| < o is given by the uniformly convergent 
series )im=1 2m(s), in which g;(s) = df[(6, 6)’]"'s, and 


&m(s) = —dF[(8, 8)’T* ¥ (2!) 
l=2 ky tkhot+-+++k =m 
k;>0 


-d*f[ (8, 6)’ \ gx, (s) ue En,(S) 


for m = 2. In particular, by (11) and Lemma 2, when s € B with 
||s|| < o and s has the form (s), 8)’, we have 


_() [-CedN(6)}'s; 
&i(s) = Cone LN (13) 


and 
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7 [I - C.dN(0)}? Oy as 
&n(s) Sal ae C..dN(6)} 7 y (L!) : 


[=2 
a; ea oe ay 14) 
Rytkhot-+-+k,=m d’N(8){ ge,(s) hi o [ gz,(s) hi 
kj>0 

for m = 2. 

Now choose 6 € (0, 61) so that || v|| < 5 implies that || Av|| < 0, and, 
referring to the g,, of (13) and (14), observe that for || v|| < 5, y of part 
(i) of the theorem is given by y = Yn-1 { 8m[ (Av, 6)’ ]}2. From (14), 


[ gm(s)h: = [I — C.dN(6)]* 


on (2) coe > Cod'N(8)[ e,(s)h +++ [8x,(s)h, 
=2 ss ae ai ee 


~ 


and 


[ gm(s)]2 = dN(6)[I — C.dN(6)]"* 


~y (Ly > Cod’N(9)[ ge,(s)]i -++ [ gx,(s)]i 
i=2 Rythyt-+-+ky=m 


+Y QW)" LY aNO)Lg,(s)h +++ [gx(s)h 
l=2 ie 


for m = 2 and || s:|| < o which, together with (13), completes the proof 
of the theorem. (The [g,,(Av)]; in Theorem 1 correspond to the 
{ 8n[(Av, 8)’]}; here.) 

2.3.2 Comments 


In principle, it is straightforward to give an explicit expression for 
any term in the series in (4). For example, it is a simple exercise to 
verify that the third-order term B[ g3(Av) |e is 


%B{dN(6)[I — C.dN(0)]~'C.. + I} d°®N(6) 
-{{I — C.dN(0)]""Av}*, (15) 


when d?N(8) is the zero operator (i.e., is the zero operator in the space 
to which d?N(6) belongs). If dN() also is the zero operator, then of 
course (15) is simply 


YsBd°N(8)(Av)?*. 


The interpretation of (15), and more general expressions, under 
certain assumptions concerning the forms of N, A, B, and C is ad- 
dressed in the following section. 
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Theorem 1 (and Lemma 1) hold if Z..(C) and H(C), respectively, are 
replaced with any complex Banach space and any linear space con- 
taining the elements of the complex Banach space. 


2.4 Volterra expansions 


In this section, we introduce, discuss, and prove our main result. We 
begin by considering the following definitions and two hypotheses, B.1 
and B.2. 

For each J = 1, 2, --- , let Ro(Z) denote the subset of R“*” given by 
Ro(L) = {(vo, vi, «++, V1) E R™:up = v; = 0 for i = 1, 2, «++, 1}. 

For any positive integers g and J, let S denote the set of complex 
n X q matrix-valued functions h defined on Ro(Z) such that each hj; is 
Lebesgue measurable and bounded on Ro(Z), and satisfies 


sup [ | Aij(t, 71, +++, T1)|d(71, +++, 71) < . (16) 
t=0 [0,¢}2 
B.1: There are elements a, b, c, and d of S{ such that for each p € 
H(C), 
t 
(Ap) (2) -| a(t, t)p(t)dr 
0 
t 
(Bp) (t) = | b(t, t)p(r)dt 
0 
t 
(Cp) (t) = | c(t, t)p(t)dr 
0 
t 
(Dp) {t) -| d(t, t)p(r)dr 
0 
for t= 0. 


In hypothesis B.2 below, Io denotes the set {z € C”:|z;| < y for i = 
1, 2, --+, m}, in which y is a positive constant and C” is the normed 
linear space of complex column n-vectors with zero element 9c and 
norm |-| given by |z| = max;| z;| for z € C”. 
B.2: N is defined on I’ = {s € L..(C): || s|] < y} by 


(Ns)(t) = y[s(é), t],  t20 


where 7 is a map from Ip X [0, ©) into C” with the following 
properties: 

(t) n(8c, t) = Oc for t= 0. 

(ti) The function é given by &(¢) = n[s(t), ¢], t = 0 is Lebesgue 
measurable on [0, ©) for each s ET. 
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(vit) For each ¢ € [0, ©), y(-, £) is a continuous map of Ip into C’, 
and for each ¢ € [0, ©), for 1 = 1,7 =n, and for any point a ET, the 
function 2; +> ni{ai, +++, Qj-1, 2;, Aj+1, °**, An, t) is differentiable with 
respect to the complex variable z; for | z;| < y. [This implies (see Ref. 
8, pages 204, 205, 226, 227, 230) the existence throughout I of every 
mth order partial derivative 

On: 


—_——_—_ (17) 
02;,,02),,_, 20° 02), 


for each ¢ and all m.] 

(tv) For any m, ji, +++, jm, and 1, the partial derivative (17), which 
we denote by p(21, «++, Zn, t), satisfies the conditions that the function 
t > p(0, ---, 0, ¢) is bounded on [0, ©), and that p is uniformly 
continuous on closed subsets of Io uniformly in ¢, in the sense that 
given a closed I'oo C Io and a 6; > 0 there is a 6; > 0 such that 


| p(Za1, et, Zany t) — p(Zn, se Zbny t) | < 51 


for t = 0 whenever z, and z, are elements of Ioo such that 
|Za — 25| < do. 
Following are comments and an example. 
If n(-, £) is independent of ¢ and (iii) is met, then (ii) and (iv) are 
met. 
The conditions on 7 of B.2 are met if, for example, 
p 


ni(z, t) = x Biy(t)Ay(zi), t2=0 
Ga 

for each i and z € To, in which p is a positive integer, the B;; are C’- 
valued bounded measurable functions, and each d,; is an analytic 
function from the disk | z;| < y in C’ into C’ such that A,;(0) = 0. In this 
important case, N restricted to I’ can of course be represented by n 
single-input single-output memoryless, possibly time-varying, nonlin- 
ear operators. 

In order to introduce another needed hypothesis, consider the fol- 
lowing proposition. 
Proposition 1: When c € S*? and n satisfies the conditions of B.2, for 
each p € H(C) there exists a unique gq € H(C) such that 


t 
p(t) = g(t) — | c(t, 7)L(r)g(t)d7r, t20, (18) 
0 
where L is the n X n matrix-valued function defined on [0, ~) by 
Lij(t) = dni(2i, +++, Zn, t)/dz; at 2) = 22 = +++ = 2, = 0 for each i, j, 
and t. 


Since L is measurable on [0, ) (see the proof of Lemma 3 in 
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Appendix B), Proposition 1 follows at once from Lemma 6 in Sec- 
tion 2.4.2. 

B.3: Under the hypotheses of Proposition 1, (18) has the further 
property that p € L..(C) implies that the solution q also belongs to 
L..(C). 

The “further property” of B.3 has the interpretation that the feed- 
back part of the graph of Fig. 1, shown in Fig. 2, is bounded-input 
bounded-output stable in the indicated sense when N is replaced with 
its linearization at the origin [by which we mean its linearization (see 
Lemma 3 below) at @ extended in the natural way to all of H(C)]. The 
node labeled “output” in Fig. 2 is an intermediate output node. For 
our purposes here, the output label can be moved to the node to the 
right of N when the matrix L(t)~’ exists for each ¢ = 0 and has 
uniformly bounded elements. 

We shall use also the following definition and proposition: 
Definition: Throughout the remainder of this section, for each l, 
x[v(71), +++, U(t:)] denotes the column vector of order n' whose 
elements are the n’ distinct products Vo, (T1) Va, (T2) +++ Vo,(t1), Corre- 
sponding to distinct sequences w, we, +++, w; with each w; drawn from 
{1, 2, -+-, n}, arranged in an arbitrary predetermined order. 
Proposition 2: If k; € S“) for some l, then the iterated integral 


t t 
| exer’ i kilt, Tl) *%"%y 71)x[v(71); irae v(T1) |d71 oe dt, 
0 0 


exists and is invariant with respect to interchanges in the order of 
integration for each t = 0 and uv € L.(C), and Vz,(v), defined on 
[0, «) by 


Vi, (v)(t) = | mee i Rit, t1, +++, T1)x[U(71), »+ +, v(t) ]dr1 +++ dr 
0 0 


for an arbitrary vu € L..(C), ts an element of L..(C). 
Proposition 2 is a special case of Lemma 4 of Section 2.4.2. 
The following is our main result. 

Theorem 2: Suppose that B.1, B.2, and B.3 are met. Then 

(t) The hypotheses of Theorem 1 are satisfied. 
(ii) For each 1 = 1, 2, --+ there is ak; € S“) such that 


w= > Viv) for |lv|| <4, (19) 
l=] 


with the series uniformly convergent with respect to ||v|| < 5, where v, 
w, and 6 are described in Theorem 1, and V;,(-) is as indicated in 
Proposition 2. 

(iii) Each k; can be taken to be continuous on R(l) when a and d 


210 THE BELL SYSTEM TECHNICAL JOURNAL, FEBRUARY 1982 


are continuous on R,(1), and b and c meet the condition imposed on 
s in part (ti) of Lemma 4 below [the condition is met if b and c are 
continuous on Ro(1)/. 


2.4.1 Comments 


Theorem 2 is proved in the next section. Using the proof given there, 
it can be shown that, as one would expect, the Volterra kernels k; can 
be taken to depend on only (¢ — 71), --- , (¢ — t:) when a, b, c, and d 
depend only on (¢ — Tr), and »(-, ¢) is independent of ¢.* 

Similarly, each k; can be taken to be real valued (i.e., to have zero 
imaginary part) if a, b, c, d, and the partial derivatives of n(-, ¢) at the 
origin are real valued. This shows that Theorem 2 establishes the 
existence of a Volterra-series expansion for the important correspond- 
ing case in which v, w, x, and y in Fig. 1 are restricted to be real valued 
and N (which then would be a map between real-valued function 
spaces) can be analytically extended so that the hypotheses of the 
theorem are met.' 

For the ame input case in which either n = 1 or u;(t) = 0 for all ¢ 


and i = 2, 3,--+, n, (19) takes the more familiar form 
w(t) = sf [ hilt, Ti, °°", 71) 
>» 
*U1(71)U1(T2) +++ Ui(t1)AdTidT2 +++ ati, t=0 


for sup:20|U:(t) | < 5, with the A, belonging to S{. 


By modifying the proof given in Section 2.4.2, results similar to 
Theorem 2 can be obtained for cases in which the basic underlying 
function space L..(C) is replaced with another complex Banach space, 
and/or A, B, C, and D have a more general’ (or different) form. Of 
some importance is the case in which L..(C) is replaced with the 
corresponding set L..(C)(T') of bounded functions defined on a finite 
interval [0, 7'], and a theorem along the lines of Theorem 2 for this 
case is given in Appendix F. 


2.4.2 Proof of Theorem 2 


Our proof uses five lemmas, which are proved in the appendix, and 
an inductive argument using Theorem 1. We begin with a description 
of the lemmas and some associated definitions. 


* See Proposition 7 and Lemma 2 of Ref. 1. 

In this connection, Theorem 5 of Ref. 1 can be used in place of Lemma 1 to prove 
results along the same lines as Theorems 1 and 2, but with L..(C) replaced with the 
corresponding function space over the real field, and Corollary 1 of Ref. 1 can be used 
to obtain corresponding pth order approximation results under weaker differentiability 
hypotheses. 

* Detailed results for cases in which a, b, c, and d are replaced with certain generalized 
functions, and N is not necessarily memoryless, will be given in another paper. 
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Lemma 3: Suppose that B.2 is met. Then N maps I into L..(C), each 
d”’ni[si(-), Spc Sn(-), -] 
02;,02;,, °°* OZ;, 
is bounded and measurable on [0, ») for each s ET, d”N(s) exists for 


each s ET and all m = 1, 2, --- , and, for any m, we have [d"N(s)hy 
eee h(t) ]i = 


: E a” nil silt), haan | Sn(t), t] 
200 ee ns Dhaest t= 0 
pe ey eee eg ee) 
for each s ET, each i, and any m elements hy, ho, «++, hn of L~(C). 


Definition: For each h € S‘,A denotes the function defined on 
[0, ~)"*) by A = A on Rol) and h = Ong (the zero n X q matrix) 
otherwise. 

Lemma 4: Suppose that h € S\ for some 1=1, that s ES, and that 
u is a bounded measurable function from [0, ~)' into the complex 


numbers. Then 
(t) The function k defined by 


t 
Rit, Ti °%"%y 71) = | s(t, t)A(r, T1, °°", T1)aT 
0 
for (t, 1, +++, 71) © Ro(l), belongs to S{. 
(ii) If h is continuous on Ro(l), and § meets the condition that 
each §;;, given by 
Oi(a, t) = | | Si(t +a,T)— Si(E, t)|dr 
0 
for t=0 and (¢ + a) = 0, satisfies 5;;(a, t) > 0 as a— 0 for each t, then 


k is continuous on Rol). 
(tit) The iterated integrals 


t t 
| al hit, T1, cee, TH)U(TI, ++, TAT «++ AT) 
0 0 
and 


t t 
| tee j R(t, 1, +++ T1)U(TI, +++, TIT +++ dt, 
0 0 


exist, and are invariant with respect to interchanges of orders of 
integration, for t = 0, and p defined by 


t t 
pit) = | ees i h(t, Tl, ° ey T1)U(n1, Sp ey T1)d71 eas dvi, t=0 
0 0 
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is an element of L..(C). 
(tv) We have 


t ; 2 
| s(t, T) | wae | A(t, T1, °*%%y 71)u(n1, ees T1)a7) eee dridt 
0 0 0 


t t 
= i see | Rit, 11, «++, T1)U(T1, ¢ ++, TAT © ++ AT, t= 0. 
0 0 


Comment 

The condition on s of part (ii) of Lemma 4 is met if s is continuous 
on Ro(1), or if s(t, r) depends only on the difference (¢ — 7) (see Reef. 
11, page 12). 
Definition: If r and s are two complex column n-vectors, then rs 
denotes the column n-vector defined by (rs); = r;s; for i = 1, 2, +--+, 7. 


Lemma 5: Ifh € Ss” andkeE se. then the function s, defined on 
Ro(p + q) by 


s(t, Ti, °°" 5 Tp+q) = hit, Ti, °"°%, tr) kit, Tp+1, °%"%, Tp+q) 


for (t, T1, ++, Tp+q) © Ro(p + q), belongs to S\?*®. 
Lemma 6: If \ € S®, then for each p € H(C) there is a unique q € 
H(C) such that 


t 


p(t) = g(t) — | A(t, r)q(t)dt, = t= 0. (20) 


0 


In Lemma 7, below, we refer to the following two hypotheses. 
C.1: ) € Si, and A denotes the map of L..(C) into itself defined by 


t 
(Ap)(t) = i A(t, 7) p(t) dr, t=0 
0 
for p € L..(C). 
C.2: \ € S®, and, for each p € L..(C), the unique element g of H(C) 
such that 


t 


p(t) = g(t) — | A(t, t)q(t)d7, t20 


0 


satisfies the condition that g € L..(C). 


Lemma 7: Suppose that C.1 and C.2 hold. Then (I — A) is an 
invertible map* of L..(C) onto itself, and there is ax © SY such that 


* Here, as in Section 2.3, J denotes the identity transformation on L..(C). 
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t 


(I — A) ’p(t) = p(t) - [ k(t, t)p(t)d7, t20 


0 


for every p € L..(C), and such that if \ meets the conditions imposed 
on s of part (ii) of Lemma 4, then so does k. 

This concludes our statement of the lemmas that we shall use. As 
mentioned at the beginning of this section, Lemmas 3 through 7 are 
proved in the appendix. 

It is clear that (under the hypotheses of Theorem 2) A.1 is met, 
Lemma 3 shows that A.2 is satisfied, and, by Lemmas 3 and 6, as well 
as the observation that C.1 together with C.2 imply that (J — A)’ 
exists, we see that A.3 also is satisfied. Therefore, the hypotheses of 
Theorem 1 are met. 

With v, w, and 6 as in part (ii) of Theorem 1, 


w = Du + : B[ gn(Av) Jo 


for ||v|| < 5, where the [ g;,(Av) ]2 are defined by (5) through (9), which 
involve associated functions [ gn(Av) ]:. 

For each positive integer p, let H, denote the hypothesis that we 
have 


[ 8m(Av)]:(é) -| pane | Qm{t, T1, +++ Tm) 
0 0 


-x[v(71), oes U(Tm) |d7, see dtm, 
and 
[ 8m(Av) Jot) = L(A)[ gm(Av) ]i(é) 


an t t 
+ »> (1!) > Bilt) | ae ats | Thy bl, T1, °*°, Tm) 
I kytkot+-+++ky=m 0 0 


=2 


-x[U(71), +++, U(tm) dt +++ dtm 


for t = 0, ||v|| < 6, and m = 1, 2, --+, p, in which 
(t) by the sum over J when m = 1 is meant the zero n-vector, 

(ii) each gm belongs to S{%., 

(uit) L is the n X n matrix-valued function described in Propo- 
sition 1, 

(iv) for 1 = 2, the B, are bounded* measurable n x n' matrix-valued 
functions over [0, 0), 

(v) for m = 2, the rz,,..., are n’ X n™ matrix-valued functions 


prt 


* By B, bounded is meant that its elements are bounded. 
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defined on Ro(m) such that each (rz,,...2,)ii € SY” with n = 1, and 

(vi) the qm, and the ry,,...,.2, for m = 2, are continuous on Ro(m) 
when a is continuous on Ro(1) and c meets the conditions on s of part 
(ti) of Lemma 4. 

By Lemma 3, L is bounded and measurable. Using (5) and (6) as 
well as Lemmas 3, 4, and 7, we see that Hj, is met. [Notice that s given 
by s(t, r) = u(t, r)v(rt) meets the condition of part (iz) of Lemma 4 
when u € S!", u meets the condition, and v is a bounded measurable 
n X n matrix-valued function on [0, ).] Thus, by Lemma 4, there is 
a k; € S® such that 


Do(t) + BL gi(Av) ]o(t) = | ki(t, t)v(7)dz, t=0 
0 


for ||v|| < 6, and A; is continuous under the conditions on a, b, c, and d 
of part (iii) of Theorem 2. 

By Lemma 4 (which holds for any n), it easily follows that if H, is 
met for some p = 2 then there is a k, € S43) such that 


B[ gp(Av) }o(¢) -| nie | Rp(t, 71, +++ 5 Tp) 
0 0 


-x[U(71), +++, U(tp) ]dr1 +++ dtp, t=0 


for ||v|| < 6, and such that k, meets the continuity requirement of part 
(tit) of the theorem. Therefore, to complete the proof of the theorem 
it suffices to show that H, is met for every p. For this purpose, suppose 
that H, is satisfied for some p. Using (7), we have 


(p+) 
Awsy = Y (LY) > d'N(6) 
t=2 Rythot- ++ +k, =( pt) 
k,>0 


*[ gx,(Av) hil ge,(Av))i +++ [ge(Av)]1, [lvl] < 6. 


Now let J be a fixed integer such that 2 <= /< (p + 1), and let hi, ---, 
k, be positive integers such that ki + ko + +--+ + ki = p + 1. Using 
Lemma 3, 


{d'N(6){ gx,(Av) il gx,(Av)]i «+> [ge,(Av)]i(d)}: 


ea as bilt, ji, +++, Ju) [[ fae Ti, °° *, TR)X 


Jy=1 


se 


J 


-[v(71), +++, v(t2,)]d71 -- ara] ; {[- [ Quilt T1, +++ Th,) 


-x[v(71), +++, U(t%,) dt +> an, 
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for ¢ = 0, ||v|| < 6, and each i, in which the 06;(-, 71, --+ , jz) are bounded 
and measurable. By Lemmas 4 and 5, we see that 


d'N(#)[ gz,(Av)]i «++ [ge(Av)]i(t) 
= B,(t) | eee eer T1, °**, T pt) 
0 0 


-x[v(7), cee, U(T(p+1)) |dr1 °° T(p+l)s t=0 


for ||v|| < 5 and for some B; and ry,,...,., of the type required. [Here we 
have used the observations that a product of integrals 


| aes i gn, (t, T1, °°°*%, Tr, i,t, {xlLv(71), ones v(TE,) }} 2, 
0 0 


t t 
-dt eee dtr, eee i eee : gz, (t, T1, °° *, Th inl 
0 0 


-{x[v(71), +++, v(tx,) J}y,dt1 «++ ate, 


in which J; is drawn from {1, 2, ---, 7} for each j, can be written as 
the iterated integral 


t t 
| eee | gp, (t, TI, eee ’ Thirty eee art, T (kit. ++ +k +05 eee ; 
0 0 


7 p+ )jt{XLU(T1), +++, U(TH,) Dy 
. {x[U(TG,+---+2),41)) einer V(T(p+1)) ]},€71 sere ar ( p+1), 


and that r, given by r(t, 71, -++, Tp+1) = ga, (t, T1, +++, T:,)° 
ily QR AE, Tey+---thyy+Ds °° * » Te+1) 7,1, ON Ro(p + 1), is continuous when 
each q;, is continuous on Ro(X;).] 

Finally, using (8) and (9), and Lemmas 4 and 7, we observe that 
H( p+1 is satisfied, showing that H, is met for all p. This completes the 
proof.* 


APPENDIX A 
Proof of Lemma 2 
Assume that p € X is given. 


Let Q denote the linear map from & into the space L(&, &) of 
bounded linear operators from # into &%, given by 


* Our proof shows also that the theorem holds if B.1 and B.2 are modified to the 
extent that an arbitrary constant (scalar or nm X n-matrix) multiple of the identity map 
in H(C) is added to B, and 7(-, ¢) is required to be independent of t. 
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_ —C..d?N (piri 0 
a @N(piyri: 0 a 
for any r € &. Since sup{|| Qhihe||: 41, he € B with || hi || = || Ae] = 1} is 
finite, it follows that @ is bounded. 
Let h € & be such that (p + h) € X. Observe that, using (10) and 
(21), 
|\df(p + h) — df(p) — Qhl| = sup{||df(p + A) 

— df(p)h — Qhhy|:-’1 € B, |lAr|| = 13 = o(/ Al), 
which shows that d’f(p) exists, that d?f(p) = Q, and hence that the 
expression for d’f(p)hyhz given in the lemma is valid. 

Now suppose that for some / = 2, d'f(p) exists and that it satisfies 


(12). Met M denote the continuous multilinear mapping of Z“*” into 
B given by 


- —C..d YN (pi)qugar +++ Que 
M(qi, G2, °°° ; = 
(1; 42 qu+y) dN (piqugai °**Q(i4+1)1 


for qi, G2, -**, Qu+1) belonging to Z. We shall use M to denote the 
usual associate (Ref. 10, page 318) of M that belongs to L(Z%, 
L(Z, -++ , L(Z, B)-+-)) with (J + 1) L’s, in which L(Aj, Az) stands for 
the set of continuous linear operators from the Banach space A, into 
the Banach space A2.* 

Using the fact that 


\|d'f(p + h) — d'f(p) — Mh|| = sup{||d'f(p + A) 
-Ayhe «++ hi — d'f(p)hAihe +++ hy 
— Mhhyhz «++ Ayl:|| Aa || = ||Ael] = --- = el =D 


for (p + h) € X, as well as the boundedness of C.., we find that || 
d'f(p + h) — d'f(p) — Mh|| = o(||A])) as ||A|| > 0, which shows that 
d‘* f(p) exists and equals M. This proves the lemma. 


APPENDIX B 
Proof of Lemma 3 

For each ¢, (viz) implies (Ref. 8, pages 204, 205, 226, 227, 230) the 
existence throughout Ip of the F’-derivatives of all orders of the map 
n(-, t): To C C” — C”. In particular, each partial derivative (17) exists 
in Ty for any t= 0." 


* For example, if J = 2, te L(Z, +--+, L(Z, B)-+-)) = L(, L(S, L(Z, B))). 
t See Section 8.9 of Ref. 
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Given anys ET, 
1 
n[s(é), t] = | dy[ Bs(t), t]dB-s(t), t20 
0 
in which dy Bs(t), ¢] is the F-derivative of y(-, ¢) at the point Bs(t) 
(i.e., dy[ Bs(t), t] is the n X n matrix whose yth element is dn;(z, t)/dz; 
evaluated at z = @s(t)). By (iv), the elements of dy[ s(t), £] are 
bounded on (8, t) € [0,1] < [0, ©). Thus, using (zz), N maps T into 
L..(C). 
Similarly, for any s € T and any A € L..(C) such that (s + h) ET, 


nls(t) + A(t), t] — nfs(t), €] — dyfs(t), A) = [ {dn[B(s(2) 
0 


+ h(t)) + (1 — B)s(t), t] — dyls(é), t}}dB-A(t), t20. 
This, together with the continuity described in (iv), yields 
sup |n[s(¢) + A(t), €] — nfs(e), f] — dnfs(e), EV()| = ol) (22) 


as ||h|| > 0. Since the pointwise limit function of a sequence of 
(Lebesgue) measurable functions is measurable, and, for each i = 1, 
2, +--+, n, (22) holds with A(t) = ou(t) for t = 0, in which o is a scalar 
and u(z) is the element of C” with u(z); = 1 and u(i); = 0 for i FJ, it 
easily follows that the elements of dy[s(-),-] are measurable on 
[0, 0). By (iv) these elements are bounded. Thus, using (22), dN(s) 
exists and 


[AN (syh(e)j = yy THESE, ++ SalO), € 


Ji=1 0Z}, 


A(t), t20 


for each i. This shows that the m = 1 part of the lemma is true. 
Now assume that the assertions of the lemma are true for 1=m = 
l, and again let s € T be given, and let h € L..(C) satisfy (s + h) ET. 
By (iv), each 


a nT si(-), Ee Sn(-), -| 
OZ 41) 20+ OZ, 


(23) 


is bounded on [0, ~). To see that each is measurable, observe that for 
hy, he, --- , hi belonging to L.(C), 


: ” dnilsi(t) + hilt), «++ , S(t) + An(t), €] 
sup max see a i Se A RS Saabs ae RF | 
ded i as P 02;, eee 0zZ;, 14, ) 
i ” dnilsilt), «++, Sn (t), ¢] 
ON a ee eenaa 
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~ 


n 2 a n{si(t), «++, Snlt), C] 


one dy = tds hi; (t 
ui 2 Foe OZjy41 °° OZ), 
L 
ro + Ay (thi. (t)| = o(|] Al) II ale (24) 
ie 


which is a consequence of (iv) and the relation 
é! iLsi(t) =e hi(t), de Sn(t) + h,(¢), t] - dnilsi(t), ee gy Sn(t), t] 


0Z;, +++ O2;, 0Z;,°*+ O2Z;, 
x an [si(t), eyed Sn(t), t] 
= 2 Se ig) 


Ju+y=1 254,192); camel dz), 


0 


92 5.4419Z), se + 02), 


Aju, (t) 


Jasry=1 


n (1+1),. ores 
a; Oo OT, (0 La, t= 0. 


Jusy=l 82 )41,,:92), ges dz), 


It easily follows from (24) that each function (23) is the pointwise limit 

of a sequence of measurable functions, and is therefore measurable. 
In particular, Q:(s) defined by 

[Q:(s)(pi, +++ , Pern) (t)]i 

if y i Z an si(t), ees 59 Sn(t), t| 


Ju+y=) 4=1 j=l 92141) 9Zi) ves 02), 


*Prj,(E) p2j,(t) +++ Durnin t=0 


for Pi, Pz, +++, Dus in L.(C) and i = 1, 2, ---, n, is a continuous 
multilinear mapping of L..(C)“*” into L.(C). 

Proceeding as in the proof of Lemma 2, let Q;(s) denote the usual 
associate of Q:(s) that belongs to L(L.(C), L(L.(C), ---, L(L.(C), 
L.(C)) -++-)) with (2 + 1) L’s, in which L(A,, Az) stands for the set of 
continuous linear operators from the Banach space A; into the Banach 
space Ao. Using || d'N(s + h) — d'N(s) — Q1(s)A|| = sup{||d'N(s + 
h)hi «++ hy — d'N(s)hky +++ Ai — Qi(s)Ahy +++ Ail: ||Aal] = | Aell = 
-++ = ||/Az|| = 1}, as well as our induction hypothesis and (24), we see 
that ||d'N(s + h) — d'N(s) — Q.(s)h|| = o(||A|) as || || > 0. Therefore 
dt N(s) exists and is equal to Q;(s). This completes the proof. 


APPENDIX C 
Proof of Lemmas 4 and 5 


It suffices to prove the lemmas for n = 1 and u(1, --- , 71) = 1 for 
(m1, «++, T2) € [0, ©)’, and attention is now restricted to that case. 
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For ¢ > 0, one has 


Jo 


t 
| s(t, T)A(t, 11, +++ , TI) dt d(m1, +++, at) 
0 








t 
<| | s(¢, t)|-| A(z, Ti, -++, 71)|drd(n1, 29+, 77) 
[0,27 Jo 
t 
<| | | A(z, Tl, oe , 71) |d(n1, Tene , 71) | s(t, T) |dr 
0 Yj[O, ¢]! 


t 
< sup { [A(r, 71, +++, T1)|d(m1, +++, 72)-Sup | | s(é, 7) |dr, 
[0,r}! t=0 0 


720 


in which the measurability of 
t 
| s(t, T)A(r, 11, «++, T1)dr (25) 
0 


in (rt, +++, 72), and the justification for the interchange of the order of 
integration, follow from theorems of Fubini and Tonelli (Ref. 12, pages 
137-145). The measurability of (25) in (¢, 71, «++, 7) is also a conse- 
quence of these theorems.* Thus, since it is clear that k is bounded, 
(t) holds. 

Now let h and s satisfy the conditions of part (iz). Let (é, 71, «++ ,77) 
€ Rol) be given, let a, a, -++, a: be real variables such that 
(¢ + a, 71+ a, +++, 71+ a1) € Ro(l), and notice that 


k(t + a, 71 + a1, +++, 71+ a1) — R(t, 71, «++, 71) 


— i [s(t + a, tT) — S(t, 1) ]A(r, Ti+ Q1, °°*,T1+ arat 
0 


+ | S(t, t)[A(r, Ti + a1, °°*+,T:+ a1) — A(r, T1,°*+, 71) |dr. (26) 
0 


Using the hypothesis of part (iz) concerning s, the boundedness of h 
and s, and the uniform continuity of h on compact subsets of Ro(Z), we 
see that each integral in (26) approaches zero as (f + a, 7: + ai, -*°, 
1 + a1) > (t, 71, -++, T1), Showing that (iz) is true. 

Straightforward modifications of the proof of part (iii) of Lemma 3 
in Ref. 1 establish that (zit) here holds. 

With regard to part (iv), using the theorems of Fubini and Tonelli 
cited above, and the proposition that a bounded measurable function 


* Consider, for arbitrary finite T > 0, the.existence and iterated-integral representa- 
tions of the multiple integral fro, ry» S(t, T)A(7, T1, «++, TH (E, T1, «++, TI, 7). 
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on a set £ of finite measure is summable over E, we have 


t T T 
i s(t, T) i matic, | h(r, Pg se ey t1)d7 er eie dt,dr 
0 0 0 


t 
= | s(t, T) h(r, Ti, **°", t1)d(7, wee ys t1)dt 
0 (0,7} 
t 
= { s(t, T) Ar, Til, *%", 71)d(71, ae | t1)dt 
[0,¢]’ 


0 
t 
= i s(t, t)A(r, Tl, ee" y t1)dtd(n, vey T1) 
[0,2 Jo 


t t 
-| al R(t, m1, ..., 71)At1 «++ dtr 
0 0 


for t = 0, which establishes (tv) and completes the proof of Lemma 4. 
Under the hypothesis of Lemma 5, 


if | A(t, Tih", tp) Rt, T(p+l)) **%*% 5 T(p+q)) |d(7, met, T(p+q)) 
[0,¢}(P+9) 


<= sup | | A(t, m1, +++, Tp) |d(T1, +++, Tp) 
[0,2]? 


x sup | R(t, T(p+1)) *°* » T(ptq) |d(t(p+1), 9S oy T (p+q)) 
t=0 [0,c]7 


for every t = 0, which proves the lemma. 


APPENDIX D 
Proof of Lemma 6 


By the proof of Theorems 2.3 and 2.5 of Ref. 13, there exists a 
measurable function « from Ro(1) into the set of complex n X n 
matrices such that the elements of « are bounded on bounded subsets 
of fo(1), and « satisfies the resolvent equations 


k(t, tT) + A(é, 7) = i A(t, u)k(u, r)du (27) 


k(t, T) + A(t, T) = | k(t, u)A(u, 7)du (28) 


fort=7r20. 
For each p € H(C), the function g defined on [0, ) by 
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t 
q(t) = p(t) - i k(t, T)p(t)dt, = t20 (29) 
0 
belongs to H(C), and, using (27) as well as theorems of Fubini and 
Tonelli (Ref. 12, pages 137-145) to justify an interchange of order of 
integration, it is simple matter to show that g given by (29) satisfies 
(20) for each p € H(C). Similarly, it is essentially well known that (28) 
can be used to show that if there is a g € H(C) that satisfies (20) for 
a given p © H(C), then q satisfies (29), which completes the proof. 


APPENDIX E 
Proof of Lemma 7 

By Lemma 6 and its proof, (J — A) is an invertible map of L..(C) 
onto L..(C), and there is a measurable matrix-valued x, defined on 
Ro(1) such that the elements of « are bounded on bounded subsets of 
R,(1), with the property that (28) is satisfied and 


t 


(I — A)“ p(t) = p(t) - | k(t, 7)p(t)dr, t20 


0 


for each p € L..(C). Since (I — A)~’ maps L..(C) into itself, it follows 
(Refs. 14 and 15) that each x;; satisfies 


t 
sup i | «is(E, T) |dr < 0, (30) 
t=0 0 


Using (28), (30), and the boundedness of A, we see that k is bounded on 
Ro(1). Therefore, « € SY. 

Assume now that 4X satisfies the condition on s of part (17) of Lemma 
4, recall that « satisfies (27), and let r be defined by 


r(t, 7) = | A(t, u)K(u, r)du 


fort=7r=0." 
Let ¢ = 0 be given. For arbitrary i and /, let 


A;;(a, t) = [ |7i(¢ +a,T)— PAG T) |dr 
0 


for (¢ + a) = 0 (see the definition preceding Lemma 4 for the meaning 
of 7; r belongs to S2 because « and A do and (27) is met). Notice that 
to complete the proof of our lemma, it suffices to show that 
Ai;(a, t)— O0asa— 0. 


* With regard to the meaning of k, see the definition immediately preceding 
Lemma 4, 
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It is clear that A;;(a, 0) — 0 as a > 0. Assume now that ¢t > 0, and 
let a be such that t — |a| > 0. We have 


t—|a| 
Ai;(a, t) = | |rij(t + a, tT) — rij(t, 7) |dr 
0 


t+|al 
+ | | Fist +a,T)— Fi(t, tT) \dr, (31) 
t 


lal 


in which (by the boundedness of r) the second integral goes to zero as 
a — 0. Further, 


t-|a| 
| |rij(t + a, tT) — rij(t, 7) |dr 
0 


i 
0 


n t 
= | Arlt, u) Kpj(u,t)du | dr 
k= Jo 


n 


t+a 
By | Vx(t + a, U)K,;(U, T)du 
0 


k=1 





n 


t-|a| ro 
= oy | [Ain(t + a, u) — Aix(t, u) |-|aj(u, T) |dudr 
0 


k=1 Jo 


which, using the boundedness of the x;;, shows that the first integral 
on the right side of (31) also approaches zero as a — 0. 


APPENDIX F 
Volterra Expansions on a Finite Time Interval 


In this appendix, T denotes an arbitrary positive constant, L.. (C) 
(T) stands for the complex Banach space of measurable complex 
column n-vector-valued functions v defined on [0, JT] such that the jth 
component u; of uv satisfies sup;yefo,r} | u;(t) | < © for 7 = 1, 2, ---,n, and 
where the norm ||- |r on L..(C)(T) is given by |ju||r = max; sup; 
|v,(¢)|, and for each J = 1, 2, --- , Ro(l)(T) denotes the subset of R“*” 
given by Ro(l)(T) = {(vo, v1, «++, vr) EC R“”:T = vo = uv; = O for 
i=1,2,---, Z}. 

Similarly, for any positive integers q and J, S‘(T) denotes the set of 
complex n X q matrix-valued functions h defined on Ro(l)(T) such 
that each h;; is Lebesgue measurable and bounded on Ro(/)(T). 

We shall refer to the following two hypotheses. 

D.1: There are elements a, b, c, and d of S?(T) such that for each 
pEL.(C)(T), 
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t 


(Ap)(t) = i a(t, 7) p(t)dr 
0 

(Bp) (t) -| b(t, 7) p(t) dr 
0 

(Cp) (t) = | c(t, tT) p(t)dr 
0 


(Dp) (t) = (i a(t, t) p(t)dr 


0 


for ¢ € [0, T’]. 
D.2: With y, To, C”, and @ as indicated in the paragraph preceding B.2 
of Section 2.4, N is defined on T = {s € L..(C)(T):||s||7 < y} by 


(Ns) (t) = nfs(¢), ¢],  tE[0, T], 


where 7 is a map from Io X [0, T] into C” with the following 
properties: 

(t) (Oc, t) = @ for t € [0, T]. 

(ti) The function € given by é(¢) = 7[s(t), t], 0 sts T, is Lebesgue 
measurable on [0, 7T'] for each s ET. 

(iii) For each ¢ € [0, T'], n(-, £) is a continuous map of Tp into C’, 
and for each ¢ € [0, 7'], for 1 Si, 7 Sn, and for any point a € Tp, the 
function 2; +> ni(a1, +++ , Qj-1, 2j, Aj41, *** , Qn, t) is differentiable with 
respect to the complex variable z; for |z;| < y. [This implies (Ref. 8, 
pages 204, 205, 226, 227, 230) the existence throughout Ip of every mth 
order partial derivative 


Oni 


——_— 4 (32) 
92 j,0Zjn_y *** OZ 


for each ¢ and all m.| 

(iv) For any m, ji, «++ , Jm, and 1, the partial derivative (32), which 
we denote by p(21, +++ , 2n, t), satisfies the conditions that the function 
t > p(0, -+-, 0, £) is bounded on [0, 7], and that p is uniformly 
continuous on closed subsets of Io uniformly in ¢, in the sense that 
given a closed Ioo C Io and a 6; > 0 there is a 6: > 0 such that 


| p(Za1, °°, Zan, t) — P(2a1, s2* y Zbny t)| = 6b: 


for ¢ € [0, T] whenever z, and z, are elements of Ioo such that | 2. — 
20 | < do. 

Direct modifications of the proof in Section 2.4.2 suffice to establish 
the following result, in which by Proposition 2’ we mean the corollary 
of Proposition 2 obtained from Proposition 2 by replacing S?, t=0, 
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L..(C), and [0, ©) by S(T), ¢ € [0, T], L.(C)(T), and [0, T], 
respectively. * 

Theorem 3: When D.1 and D.2 are met, there is a positive number 6 
and an open subset S of I’ of D.2 with the following properties. 

(t) S contains the origin in L..(C)(T), and for each v € L..(C)(T) 
with ||v||7 < 6, there exist unique x, y, and w of S, L..(C)(T), and 
L..(C)(T), respectively, such that (2), (3), and y = Nx hold. 

(ii) For each J = 1, 2, --- there is a ky €Sy’(T) such that 


w= )) Vidv) for |lvilr< 64, 
I=1 


with the series uniformly convergent with respect to ||uv||7 < 65, where 
V.,(-) is as indicated in Proposition 2’ (which is described just before 
Theorem 3). 

(tit) Each k; can be taken to be continuous on Ro(/)(T) when a, 3, 
c, and d are continuous on Ro(1)(T). 
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* In this connection, see the last paragraph of Section 2.3.2. Also, a hypothesis 
corresponding to B.3 is not needed because when D.1 and D.2 hold, and L(t) is as 
indicated in Proposition 1 for ¢ € [0, 7], for each p € L.(C)(T) there exists a unique 
q € L.(C)(T) such that p(t) = g(t) — fo c(t, r)L(r)q(r) dr, 0S tS T. On the other hand, 
6 and S of Theorem 3 depend on T. 
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An Approximate Thermal Model For Outdoor 
Electronics Cabinets 


By J. C. COYNE 


(Manuscript received November 18, 1980) 


Electronic systems are installed in outdoor loop plant in metal 
cabinets which are essentially unventilated. The current trend toward 
greater circult miniaturization and higher power density increases 
the difficulty in the thermal design and points up the need for 
comprehensive thermal design guidelines. As a step toward that end, 
an approximate lumped thermal-conductance model is presented for 
calculating steady-state temperatures at three points of the cabinet 
(the hot sunny wall, the cool shaded wall, and the top row of circuit 
boards) as functions of input parameters of cabinet geometry, wind 
speed, solar radiation, and internal heat dissipation (assumed to be 
uniformly distributed). Calculated results are found to be in agree- 
ment with tests within about 10 percent. 


I. INTRODUCTION 


Electronic systems are typically installed in outdoor loop plant’ in 
metal cabinets which are essentially unventilated. The cabinets are 
subject to ambient temperature excursions from —40 to 120°F and 
subject to solar heating which can raise the cabinet interior tempera- 
tures 30°F above ambient. With a thermal design limit of 185°F at 
circuit boards, the allowable temperature rise because of circuit dissi- 
pation is, thus, limited to about 35°F. The current trend toward greater 
circuit miniaturization and higher power density further aggravates an 
already difficult thermal design problem and points up the need for a 
comprehensive thermal analysis of the outdoor electronic cabinet. This 
paper is a step in that direction. 

The circuit boards are typically arranged in outdoor cabinets (see 
Fig. 1) in much the same way as in conventional central office equip- 
ment bays and, to this extent, the heat transfer mechanisms are 
similar. In both cases, heat is removed from the circuit boards by 
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Fig. 1—Convective heat flow in an unventilated electronic cabinet, isolated on one 
wall. 


natural convection; air is warmed by the circuitry and it expands and 
rises between the columns of boards. But unlike the central office bay 
situation, the hot air is confined inside the cabinet and must recirculate 
to transfer its heat to the cabinet walls. In addition, solar heat is 
absorbed on the sunny exterior cabinet surfaces, some of which is 
transferred to the cabinet interior and convected to the opposing 
shaded walls along with the internally dissipated heat. At the exterior 
surfaces of the cabinet, both solar and dissipated heat are convected 
and radiated to the ambient. 

Whereas, the central office bay problem has been analyzed exten- 
sively, very little has been done to date on this more difficult cabinet 
problem. However, a problem closely related to it, which has received 
a great deal of study by Elder,” Eckert, and Carlson’ and others, is 
that of a fluid-filled two-dimensional enclosure whose opposing walls 
are held at a uniform temperature difference as shown in Fig. 2. This 
enclosure problem is approximately the same as that of an empty (air- 
filled) unpowered cabinet in the sun. The main difference is that the 
walls of the electronic cabinet are not isothermal but, instead, increase 
in temperature vertically because of natural convection. 

The core region of the cabinet of Fig. 1 and the enclosure of Fig. 2 
both exhibit a large vertical temperature gradient. According to Elder,” 
the centerline temperature (midway between walls of the enclosure) 
approaches the hot wall temperature at the top and approaches the 
cold wall temperature at the bottom, with a gradient at the midway 
height given by half the difference in wall temperatures divided by the 
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Fig. 2—Two-dimensional enclosure with isothermal side walls, heated at one side, 
showing flow path and temperature profiles: a is horizontal, b is vertical. 


enclosure height. Laterally, the temperature is nearly constant except 
near the walls. The vertical gradient violates the isothermal assump- 
tion implicit in text-book, heat-transfer relationships for natural con- 
vection on vertical walls. As shown later, the error is appreciable. 

The enclosure problem shown in Fig. 2 also approximates the 
convective heat transfer between heat-dissipating circuit boards and 
the walls of the powered electronic cabinet. The boards can be thought 
of as the “hot wall” of the enclosure analyzed in the literature. The 
main assumption here is that the boundary layer flow down the shaded 
cabinet wall and the heat transfer at this wall is the same regardless of 
whether the heat originates at an opposing hot wall or at the surface 
of circuit boards. Using this assumption, heat transfer coefficients from 
the literature will be incorporated into a simple lumped thermal 
conductance model from which approximate temperatures can be 
calculated for a powered electronic cabinet in the sun. 

Several empirical relationships for the heat transfer of an enclosure 
depicted in Fig. 2 have been proposed in the recent literature. For 
conditions applicable to loop electronic cabinets (Rayleigh number 
based on. height about 10°, height-to-width ratio about 3, and Prandtl 
number equal to 0.7) they all are in essential agreement. For instance, 
Seki et al.* in a recent paper propose 
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H -0.11 
Nug = 0.36 Pr®*! (7) Raf’, (1) 


where Nuy = Nusselt number based on enclosure height, Pr = Prandtl 
number, Ray = Rayleigh number based on enclosure height, H = 
enclosure height and W = enclosure width. After substitution of air 
properties at 120°F into the dimensionless numbers, Seki’s relationship 
becomes 


Q = 0.05AwW°UH-8( Ty — Te)?, (2) 


where Q is heat transfer (watts), T7 and Tc are the hot and cold wall 
temperatures (°F), and Aw is the area (ft?) of a wall. 

Catton® recommends the Berkovsky-Polevikov® relationship which 
at 120°F air temperature becomes 


Q = 0.048AwW°"H °(TH — Tc)'”®. (3) 


For typical values of H = 3 ft, W = 1 ft, and Ty — Tc = 20°F, eggs. (2) 
and (8) agree within 10 percent. 

By comparison, the recommended’ heat transfer relationship for a 
vertical isothermal plate (assumed to exist in an infinite constant- 
temperature air space) predicts 23 percent less heat transfer for the 
same conditions. Thus, the core region in an enclosure, in particular 
the existence of a vertical temperature gradient in the core, has a 
significant effect on the heat transfer at the enclosure wall. 

For the powered electronic cabinet problem modeled in the next 
section, eq. (2) will be used as an approximation for the convective 
heat transfer both between the opposing cabinet walls and between 
the circuit boards and cabinet walls. 


ll. A LUMPED THERMAL CONDUCTANCE MODEL 


A simple electrical analog for the calculation of approximate steady- 
state temperatures in a powered electronic cabinet subject to sun and 
wind is shown on Fig. 3. The absorbed solar radiation, assumed 
incident on half the cabinet surface, is represented by the heat source 
@, (watts) and the internal heat dissipation by Qp (watts). Heat is 
transferred to ambient from the exterior cabinet surfaces by combined 
long-wave radiation and wind-dependent convection. For radiation, 
the linearized Stefan-Boltzman equation is used 

. Qrad = AceAwTi(Tw = Ta), (4) 
where o = Stefan-Boltzman constant (5 x 107”? watt/ft? — °R‘), e = 
emissivity (0 < e < 1), Aw = surface area of one wall (ft”), Tw = wall 
temperature (°R), and 74 = ambient temperature (°R). 

For the wind-induced heat transfer at the cabinet’s exterior surfaces, 
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Fig. 3—Lumped conductance model for a powered cabinet in the sun. 


consider first the cabinet top. Assume horizontal laminar air flow. The 
heat transfer (Nusselt number) expressed in terms of Re (Reynolds 
number) and Pr (Prandtl number) for this situation is given by® 


Nu = 0.664 Pr°Re®. (5) 
Substitution of air properties at 100°F gives 
Awina = 0.21(u/D)°”, (6) 


where D is the distance across the top measured parallel to the wind 
direction and u is wind speed (ft/s). For random wind direction, D is 
approximately given by the average side dimension of the cabinet. 

The wind-induced heat transfer at the cabinet’s vertical walls at any 
instant of time depends on the incident wind direction and speed at 
that time. However, as will be shown, if random wind direction is 
assumed, then the average heat transfer coefficient (A) at each wall 
can also be approximated by the simple relationship given in eq. (6). 
The object here is to preserve the model’s simplicity. 

Consider the cabinet to be a long square cylinder having side D 
whose axis is normal to the wind. As before, assume D to be the 
average side dimension of the cabinet’s rectangular cross section. For 
this situation, the convective heat transfer is given by” 


Nu = C Pr°*’Re”, (7) 
where C and m are constants which depend on wind direction. For a 


range of Re from 5 X 10° to 10°, and for the wind normal to a cabinet 
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wall, C = 0.1 and m = 0.675. Over the same range of Re, and for the 
wind direction along a diagonal (45 degrees to a wall), C = 0.25 and 
m = 0.588. Assuming random wind direction, the average heat transfer 
coefficient (h) is approximately given by the average of these two 
cases. For a typical case D = 2 ft, u = 5 ft/s and for air properties at 
100°F, A is evaluated to be 0.337. The same substitutions into eq. (6) 
gives h = 0.335. So, although each wall of the square cylindrical cabinet 
has a different instantaneous heat-transfer coefficient depending on 
wind direction, the overall cabinet coefficient is approximately the 
same as that of a hypothetical cabinet which has wind parallel to all 
its walls simultaneously. Consequently, with the assumption of random 
wind direction, the average heat transfer at each cabinet wall is 
approximately given by 


Qwina = 0.21A w(u/D)**(Tw = Ta). (8) 


For the internal convective heat transfer from the hot (sunny) 
cabinet wall to the cool (shaded) cabinet wall, and also from the circuit 
boards to the cabinet walls, Seki’s relationship given in eq. (2) is used. 
In Fig. 3, this convective coupling is represented by a nonlinear 
conductance between walls, center-tapped to the heat source Qp. In 
adopting Seki’s results to the cabinet problem, several differences 
between the ideal enclosure and the real electronic cabinet are being 
ignored. The effect of these differences is discussed in Appendix B. 

In the no-solar case (Q; = 0 in Fig. 3), Qp divides equally, half being 
transferred to each wall. Thus, the temperature at the center tap 
represents the hottest internal air temperature, which occurs at the 
top row of electronics if the heat sources are uniformly distributed. Its 
magnitude relative to the wall temperature is given in the model by 
half the dissipated power flowing through half the coupling conduct- 


ance. Thus, 
Reoupt Qp a 
T; i Tw = 9 (%) . (9) 





An isothermal board (B) shown in Fig. 3, having an area (two sides) 
of S ft”, located in the top row of electronics and dissipating gq watts 
will experience an additional temperature rise above the internal 
cabinet ambient (center tap) given by 


T's — Tr = q/(Sh), (10) 


where h is the heat transfer coefficient at the board and S is the board 
surface area (both sides). 

With no internal dissipation (Qp = 0 in Fig. 3), solar heat in the 
model flows from the hot sunny wall through the coupling conductance 
to the cool shaded wall. In this case, the temperature at the center-tap 
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Fig. 4—40E Feeder distribution interface cabinet at test site. 


represents the internal temperature at an elevation corresponding to 
the mean between the hot and cold wall temperatures, which occurs 
near the midway height of the enclosure in Fig. 2, but somewhat higher 
in the cabinet of Fig. 1 because of the cabinet wall’s vertical tempera- 
ture gradient. 

Thus, in both limiting cases of Qp = 0 or Qs = 0, the proposed 
thermal conductance model accounts for the flow of heat in roughly 
the correct way. It remains to be seen how well it agrees with experi- 
mental data. For this purpose, the model predictions will be compared 
with test data taken by the author during the summer of 1979. In these 
tests, 96 circuit boards, each with eight resistors, were mounted in 
three rows of BELLPAC* housings inside a Western Electric 40E FDI 
cabinet. Figure 4 shows photos of the front and back views of the 
cabinet at the test site. With all boards powered, the heat dissipation 
was fairly uniformly distributed in the cabinet. 

In evaluating the thermal conductances of the model for this test, 
assume that the 35 ft? surface area of the FDI cabinet is equally 
divided into 17.5 ft? of hot sunny surface and 17.5 ft? of cool shaded 
surface, each at a uniform temperature. In eqs. (2), (4), and (8) 
substitute W = 1 ft, H = 3.3 ft, D = 2 ft, u = 10 ft/s, T4 = 540°R, Aw 
= 17.5 ft”, and e = 1 to obtain Qrap = 5.51(Tw — Ta), Qwina = 7.86(Tw 


* Trademark of Western Electric Company. 
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— Ta), and Qceoupt = 0.58(TH — Tc)’. The inverse relationships, shown’ 
on Fig. 5, are 


Tw = Ta = 0.075(Qraa + Qwina) (11) 
Tu — Tc = 1.54988 0. (12) 


The relationship given by eq. (12) for the coupling conductance has 
been divided into two series elements in Fig. 5 each having a coefficient 
of 0.77. 

The value of A in eq. (10) was determined experimentally. The 
measured temperature rise at the center of the board in the middle of 
the top row relative to the air directly above (about one inch below 
the cabinet top) gives an effective heat transfer coefficient (h) of about 
0.3 watt/ft’—°F. Substitution of this value of h and S = 1 ft? into eq. 
(12) gives for a board in the top row 


Tz — Tr= 3.3q. (13) 


Solutions to the thermal model of Fig. 5 for Qp = 150 watts as a 
function of solar input power are plotted in Fig. 6. To show the 
sensitivity to wind speed, solutions for both 5 and 10 ft/s winds are 
presented. Shown on the figure are calculated curves for the cool wall, 
the hot wall and the top row of boards (points C, H, and B, respectively 
of Fig. 5). Recognize that the model predicts only a single temperature 
at each of these locations, whereas in actuality, temperature distribu- 
tions exist in each case. Experiments were then performed to verify 
the analytical results at points where the model applies. 


ll. THERMAL MODEL PREDICTIONS AND COMPARISONS WITH TEST 
DATA 


For purposes of comparing the analytic results with experimental 
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Fig. 5—-Lumped conductance model for a powered cabinet in the sun for specific 
conditions of analysis and tests. 
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Fig. 6—Measured cabinet temperatures compared with model predictions for Q, = 
150 watts and uniform breeze on all surfaces. 


data obtained by the author, two test dates have been selected because 
of their exceptionally clear skies, August 7 and September 12, 1979. 
The absence of clouds on these dates simplifies the task of correlating 
solar intensity with measured cabinet temperatures. Transient effects 
are small since the cabinet’s time constant (measured in the laboratory 
to be about 0.75 hours) is small compared with the time scale of the 
solar input waveform. Assuming the solar input to be approximated 
by a half sine-pulse of 10-hours’ duration, the peak daily temperature 
in the cabinet would be 98 percent of an ideal cabinet having no 
thermal lag. 
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Fig. 7—Average absorbed solar flux calculated from pyranometer traces for two test 
dates. 


Figures 13 and 14 in Appendix A show the incident solar radiation 
on a horizontal plane for these two dates using a Weatherman R413 
star pyranometer. The solar radiation absorbed by the test cabinet for 
each test date can be calculated from this data. Also needed is a 
specification of the absorptivity and area of each cabinet surface, as 
well as the orientation of each surface relative to the direct rays of the 
sun. This calculation is shown in Appendix A and the results given in 
Fig. 7. 

As shown in Fig. 7, the peak solar power absorbed by the cabinet on 
September 12 was about 35 percent greater than that on August 7. 
About half of this large difference was due to the sun’s lower position 
in the sky in September, which resulted in a larger component of direct 
solar radiation on the cabinet’s large back surface. The remaining half 
was due to a greater solar intensity on September 12, attributable to 
reduced atmospheric attenuation compared with August 7. As shown 
in Appendix A, the September 12 intensity equaled the values pub- 
lished by ASHRAE” for this date, while the August 7 intensity fell 
short of ASHRAE values. 

Figures 8 and 9 show measured cabinet temperatures for the same 
two dates. The cabinet top and back surface temperatures, which 
constitute the sunny hot walls, are plotted versus time of day. Observe 
that the back surface was hotter than the top surface on September 
12, but that the reverse was true on August 7, indicative of the sun’s 
lower altitude in September. The cabinet internal temperature was 
measured at the center of the board located at the middle of the top 
row and is believed to be fairly representative of the cabinet’s maxi- 
mum temperature (exclusive of hot spot effects near the resistor heat 
sources). It is close to the temperature that would occur if the cabinet’s 
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Fig. 8—Cabinet temperatures on August 7, 1979, 150-watt dissipation. 


internal dissipation were truly uniform. On September 12 (from Figs. 
7 and 9), the cabinet’s peak internal temperature rise above ambient 
was about 62°F; the absorbed solar radiation was 36.5 watts/ft”. On 
August 7 (Figs. 7 and 8), the cabinet’s peak internal temperature rise 
was about 51°F; the absorbed solar radiation was 27 watts/ft”. These 
results, along with measured surface temperatures, are plotted along- 
side the analytic results in Fig. 6. Agreement is within 15 percent of 
the 10 ft/s (6.8 mph) wind curve. The average wind speed during the 
tests is estimated to be in the range of 5 to 10 mph. 

Also given in Fig. 6 is the cool-shaded wall for August 7, shown as a 
range of temperatures (no measurement made on September 12). It is 
typical for the shaded wall to have such a vertical temperature gradient 
since it is the principal heat transfer surface for internal dissipation. 
By contrast, the sunny hot walls have a fairly uniform temperature. 
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Fig. 9—Cabinet temperatures on September 12, 1979, 150-watt dissipation. 


Evidence of this is provided by Fig. 10 which shows vertical tempera- 
ture profiles measured on the sunny (cabinet back) surface, the shaded 
(cabinet front) surface and the cabinet centerline (center of boards in 
the middle of rows). Observe the large vertical temperature gradients 
on the cabinet center line (1°F/in.) and the shaded surface (0.5°F/in.). 
By comparison, the sunny surface gradient of 0.1°F/in. is small. 

For the case of no solar radiation, Fig. 6 shows experimental tem- 
peratures measured in the lab at the same cabinet locations as before. 
Agreement is almost exact with respect to the 5 ft/s wind curve. The 
same measurements taken outdoors in the evening or early morning 
(not shown in the figure) were slightly less because of the action of 
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Fig. 10—Vertical temperature profiles on 2 p.m. daylight saving time, August 7, 1979, 
150-watt dissipation. 


light outdoor breezes. For instance, the board temperature averaged 2 
to 3°F cooler, giving closer agreement with the 10 ft/s wind curve. 
More generally, for the case of no solar radiation, the model’s 
prediction can be expressed algebraically. From inspection of Fig. 5, 
the board temperature (7's) for Qs = 0 and u = 10 ft/s is seen to be 


Tp — Ta = 0.77(Qp/2)°* + 0.075(Qp/2) + 3.3q, (14) 


where gq = @p/96. For a 5-ft/s breeze, the 0.075 coefficient becomes 
0.106. A comparison of eq. (14) with laboratory experimental temper- 
atures is shown in Fig. 11. The agreement is within 6 percent for both 
wind speeds over a 250 watt range of Qp. The calculated curve for wind 
speed of 5 ft/s is close to lab conditions of natural convection. 

Also plotted on Fig. 11 is the measured air temperature (correspond- 
ing to 7; in the model) at a point on the cabinet center line about one 
inch below the cabinet top. The measured temperature rise at the 
board (7g) relative to this (77) gives the effective heat transfer coeffi- 
cient of 0.3 watt/ft? — °F, which was used to obtain eq. (13). 
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Fig. 11—Comparison of analysis with indoor experimental results. 


Returning to Fig. 6, one sees that the measured board temperature 
agrees fairly well with the analysis for assumed breezes of 5 to 10 ft/s 
on all cabinet surfaces. Also, the mean hot wall temperature for 
September 12 (weighted mean of top and back wall) agrees. However, 
the mean hot wall temperature for August 7 is less than the model 
prediction. It is believed that the reason for this disagreement is 
different magnitude breezes on different cabinet surfaces. On August 
7, the prevailing breeze direction at the test site was from the south, 
tending to produce more convective heat transfer at the sunny surfaces 
than at the shaded surfaces. Figure 12 shows the results of recalculating 
the cabinet temperatures, increasing the assumed hot wall breeze to 
15 ft/s (10.2 mph) and decreasing the assumed shaded wall breeze to 
5 ft/s (3.4 mph). The agreement with August 7 data is improved, 
showing that a reasonable adjustment to the assumed average wind 
speed at each wall brings the analytic and experimental wall temper- 
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Fig. 12—Measured cabinet temperatures compared with model predictions for Q, = 
150 watts and different wind conditions on opposite cabinet walls. 


ature into near perfect agreement. The calculated board temperature 
is only slightly effected since it depends primarily on average wall 
conditions. 


IV. SUMMARY 


An approximate lumped thermal conductance model is presented 
for calculating maximum steady-state board temperatures in unventi- 
lated electronic cabinets, subject to both solar and uniformly-distrib- 
uted, internally-dissipated heat. The model has general applicability, 
with input parameters consisting of cabinet geometry, solar radiation, 
wind speed, and internal heat dissipation. 

The main feature of the model is the use of a convective coupling 
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conductance based on an empirical relationship found in the recent 
literature for the convective heat transfer between the hot and cold 
wall of an unpowered enclosure, and the attachment of a heat source 
(representing internally dissipated heat) to the center tap of this 
coupling conductance. 

Several phenomena are ignored in the analysis. The sensitivity to 
these are shown in Appendix B to be in the order of 10 percent. 
Consistent with this, the agreement between the analysis and experi- 
mental results is within 10 percent. 

The results of this work provides a useful design tool for determining 
the minimum cabinet size needed to safely dissipate the total heat of 
a system. The temperature rises of individual devices and circuit 
boards, which can be determined from individual laboratory tests, can 
be added to the predicted maximum cabinet ambient (7; of the model) 
to give the maximum device or circuit board temperature operating in 
the full-system cabinet ambient. Also, the analysis tells the designer 
how much temperature rise to allow for solar radiation and how much 
cooling can be expected from breezes. 

Although useful for predicting the maximum cabinet ambient at the 
top row, the model does not predict the cooler temperatures at lower 
rows. This is not a serious limitation for two reasons. First, the thermal 
design is limited by the maximum temperature which normally occurs 
at the top. Second, one can determine the vertical temperature profile 
up between the boards by scaling the results from central office frame 
analyses. The board temperatures in the cabinet are hotter than in the 
central office frame because of the reduced convective air flow in the 
cabinet, but the profiles are similar with respect to height. In both 
cases, one can assume ambient temperature below the bottom shelf of 
boards. 

A more severe limitation is the model’s inability to account for 
nonuniformly distributed heat sources. A similar difficulty exists in 
analyzing central office frames where it is common practice to compute 
temperatures based on average heat dissipation. The cabinet model, 
by considering just total dissipation, is doing a similar thing. For 
modest departures from uniformity (e.g., alternate boards powered) 
the model’s effectiveness is unimpaired. Also, certain extremely non- 
uniform dissipations can be handled. For instance, if all the heat 
dissipation is at one shelf level, good experimental agreement is ob- 
tained if an effective area is substituted into the model which excludes 
the wall below that shelf level. 

These and other topics are under study to refine the approximate 
model presented here and ultimately to be incorporated into general 
engineering design guidelines for reliable packaging of loop electronics 
in outdoor environments. 
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APPENDIX A 
Computation of Absorbed Solar Power 


Figures 13 and 14 give the vertical component of solar intensity on 
August 7, 1979, and September 12, 1979, measured by P. E. Fiechter 
using a Weatherman Star pyranometer. Also shown on each figure are 
calculated curves of this vertical component using data and relation- 
ships given by ASHRAE.”° Observe that the computed curve of total 
radiation (direct + diffuse) agrees very well with the September 12 
pyranometer trace. By comparison, the pyranometer trace for August 
7 falls short of the computed curve for this date. The discrepancy on 
August 7 is attributed to a light haze on this date. 

The total solar power absorbed by the cabinet can be computed 
from Figs. 138 and 14 and a specification of the orientation of each 
cabinet wall with respect to the direct rays of sunlight. For this 
purpose, Fig. 15 shows traces of the sun’s altitude angle (L) and 
azimuth angle (Z) (with respect to south) taken from ASHRAE.” The 
solar power absorbed by each vertical wall is given by 


Qv = aAwl cos(L)cos(Z — N) 
and that absorbed by the top (horizontal) surface 
Qu = aAwl sin(L), 
where a is the absorptivity (0.78), N is the angle made by the wall’s 
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Fig. 13—Pyranometer trace compared with solar intensity computed from ASHRAE 
data for a horizontal surface on August 7, 1979. 


THERMAL DESIGN FOR CABINETS 243 


Ee joe 
[e) 
2 an 
a z= 
= 
< = 
3 ae 
a aA 
cc Oa 
uJ = 
a MEASURED aS 
F ——— COMPUTED Sg 
< gm 





6 5 4 3 2 1 12 11 10 9 8 
TIME IN DAYLIGHT SAVING TIME 


Fig. 14—Pyranometer trace compared with solar intensity computed from ASHRAE 
data for a horizontal surface on September 12, 1979. 


normal with south, A is the wall area (ft”), and I is the intensity of the 
direct radiation (watts/ft”) given by dividing the pyranometer trace by 
sin(L). The cabinet surfaces exposed to sun were: top — 3.3 ft”, side 
— 3.3 ft?, back — 11.1 ft”. The back surface faced 30° west of south (N 
= 30°). 

The total absorbed solar power for each date, shown in Fig. 7 of the 
text, is obtained by adding the contribution of each wall. 


APPENDIX B 
The Effect of Factors Ignored in the Analysis 


Several phenomena which exist in a real electronic cabinet have 
been ignored in the analysis. This appendix discusses five of these 
phenomena and estimates their effect on temperature. 

In using Seki’s results [eq. (1)], the restriction to air flow caused by 
equipment in the cabinet is ignored. Most of the convected heat is 
carried in a thin (typically less than one inch) boundary layer at the 
cabinet walls. The air drag of the equipment, which is mostly in the 
core of the cabinet, is small provided adequate clearance exists at the 
walls. The effect on temperatures of this air drag is estimated to be 
the same order of magnitude as that caused by reducing Seki’s enclo- 
sure width (W) to a value equal to the sum of the clearances at 
opposing cabinet walls, say two inches. From eq. (2), one can see that 
the increase in temperature for an enclosure whose original dimensions 
are H = 40 in. and W = 10 in., which is reduced in width to W = 2 in., 
is about 15 percent. Thus, the effect of equipment in the cabinet is 
estimated to be of the same order of magnitude. 

Note that eq. (1) is applicable only within the boundary layer flow- 
regime which, according to Eckert and Carlson,°* occurs for height- 
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Fig. 15—Solar altitude and azimuth (from south) for test dates. 


width ratio less than 2.32 x 10°* Grw, where Grw is the Grashof 
number based on enclosure width. The limiting clearance per wall 
based on this constraint, for Ty — Tc = 25°F and H = 40 in., is 0.9 in. 
Thus, eq. (2) should be valid for the 1-inch clearance. 

A second difference between Seki’s enclosure and the real cabinet is 
the fact that the cabinet walls are not forced to be isothermal. The 
vertical temperature gradient that naturally occurs up the cabinet 
walls alters the overall heat transfer from that measured by Seki and 
others. The effect of this on the internal temperatures is difficult to 
estimate, but a rough idea can be obtained by comparing the maximum 
temperature of a constant heat flux wall with that of an isothermal 
wall, both standing vertically in a uniform ambient. For either wall, 
AT = NH°’Q°*, where N = 8.7 for constant heat flux and N = 7.2 for 
constant wall temperature. The maximum temperature of the constant 
heat flux wall is 19 percent greater. Assuming the cabinet walls to lie 
somewhere between these extremes, then the analysis underestimates 
the internal cabinet temperatures by about 10 percent. 

Conductive and radiative heat transfer have been ignored in the 
analysis. Conduction from the circuit boards to the cabinet walls is 
negligible. However, conduction from the hot sunny wall to the cool 
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shaded wall through the cabinet metal is estimated to be about 0.1 
watt/°F for the test cabinet. This conductance amounts to about 10 
percent of the convective heat flow given by eq. (2). The average wall 
temperature and internal cabinet temperature are affected to a much 
lesser extent. 

The heat transfer by radiation from the heat dissipating boards to 
the cabinet walls depends largely on geometry. For the test conditions 
consisting of 8-inch deep boards on 1-inch centers, the view factor at 
the board center is 0.03. Because of this small view factor and because 
the emissivity of the boards approaches zero in the plane of the board, 
the heat transfer by radiation is small, calculated to amount to about 
10 percent of the boards’ dissipation. Consequently, board tempera- 
tures are about 10 percent cooler than that predicted by the model. 

A fifth factor is the cabinet roof which, in the real cabinet, is a 
principal heat transfer surface but, in the ideal enclosure, is adiabatic. 
In the analyses, the roof area was lumped in with that of the vertical 
walls, thereby assigning to the roof the same average heat transfer as 
that of the walls. Some error results from the approximation. For 
instance, in the no-solar case, the heat transfer per unit area through 
the roof is approximately twice that of the walls (on average), since its 
temperature equals that at the top (maximum) of walls. Thus, for the 
test cabinet whose roof comprises 10 percent of the total cabinet 
surface, the analysis underestimates the total heat transfer and over- 
estimates the cabinet temperature by about 10 percent. 

Five factors (more could be added), which are ignored in the analysis, 
are discussed in this Appendix. Some tend to increase temperature and 
some decrease temperature. All, broadly speaking, are estimated to 
have effects in the range of 10 percent. This is consistent with the 
agreement found between analysis and experiment. 
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Fail-Safe Nodes for Lightguide Digital 
Networks 


By A. ALBANESE 
(Manuscript received October 6, 1981) 


Lightguide digital networks that use fail-safe nodes made of an 
optical regenerator and optical couplers are described and analyzed. 
Every node in the network can regenerate or overwrite information 
traveling in a ring or bus network, and in the case of a power failure 
at one of the nodes, the network continues to function because the 
coupler keeps the continuity at the failing node. Fail-safe nodes that 
operate at 16 Mb/s were built to implement a digital network of ring 
architecture. A description of the components is presented, together 
with an analysis of the design constraints of the different parts of the 
fail-safe nodes. 


|. INTRODUCTION 


The use of regenerators at the nodes of a lightwave network in- 
troduces a reliability problem when the power at one node fails. Optical 
passive couplers solve this problem, but the number of passive couplers 
in a network is limited by the maximum insertion loss that can 
be tolerated between a transmitter and the receiver farthest away 
from it.’ 

This paper describes a new arrangement for a lightguide digital 
network built with fail-safe nodes and with the characteristics that the 
number of stations is independent of the coupler insertion loss, and 
that the network keeps functioning when the power at one or more 
nodes fails. A fail-safe node consists of a lightguide receiver and a 
lightguide transmitter electrically connected by a regenerator and 
optically connected by a directional coupler. Figure 1 shows a config- 
uration for a fail-safe node consisting of a lightwave receiver and 
transmitter pair connected by a regenerator and a directional coupler 
that provides optical continuity when the power at the node fails. The 
feasibility of the network was tested using lightwave transmitters 
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Fig. 2—Ring-type lightguide network. 


(GaAlAs LED, A = 0.8 pm) and lightwave, avalanche photodiode (APD) 
receivers made by Western Electric. The repeaters were built using 
transistor-transistor logic (TTL) integrated circuits, and the network 
was operated with 16 Mb/s digital signals. 

A node can regenerate, overwrite, or be off, depending on whether 
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Fig. 3—A two-coupler node. 


the regenerator is operating. In the regenerating configuration, the 
transmitter is controlled by the receiver, while in the overwriting state, 
the transmitter is independent of the receiver. 


li. FAIL-SAFE NETWORKS 


Fail-safe nodes can be connected together by a lightguide to form a 
ring-type network as shown in Fig. 2. Each node consists of a coupler, 
C, a receiver, R, a regenerator, RG, and a transmitter, T. The nodes 
are connected by lightguides, L. The E input disables the regenerator. 
The ring architecture was selected as an example; fail-safe nodes may 
also be used in other optical bus-type networks.’ The nodes in the 
network are normally regenerating; that is, each node listens and 
regenerates the information flowing in the network. When a node 
wants to transmit, it turns its regenerator off and the information is 
inserted in the network by the lightguide transmitter. If the power at 
one node fails, or if the electronic components are removed for main- 
tenance, the node is in the off state, and the optical coupler provides 
the continuity needed for the operation of the network. An additional 
advantage is that all the signals from different stations arrive with the 
same intensity at every receiver. 

For proper operation of the fail-safe network, every node in the 
network must meet the three following constraints: 

(t) Sensitivity constraint—The receiver of any node must be 
sensitive enough to receive the signal from a preceding transmitter 
when several nodes between the transmitter and the receiver are off. 

(ti) Interference constraint—When two nodes are transmitting 
simultaneously, a node down the line should receive the signal from 
the closer node. This discrimination between the two transmitters is 
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Fig. 4—Feasibility graph for one-coupler nodes. 


achieved by the automatic gain control (Acc) of the receiver that 
adjusts the gain so the comparator circuit can detect only the stronger 
signal, while the less intense signal is below the threshold level. 

(tit) Automatic gain control response time—lIn the case of a ring- 
type network, the sending node should be prevented from regenerating 
its own pulses to avoid having pulses traveling around the ring forever. 
This constraint is satisfied when the response time of the acc in the 
lightguide receiver is longer than the time it takes a pulse to go around 
the ring once. 


lll. ANALYSIS 


Each fail-safe node in the network may have one or two couplers, 
depending on whether the network uses return-to-zero or nonreturn- 
to-zero formats. Figure 1 shows the case where one coupler is used. 
The receiver has to be off during the time the transmitter is on to 
avoid saturation. This is achieved using a signal with a duty ratio less 
than 50 percent, and having the transmitter operating out of phase 
from the receiver. Figure 3 shows the coupler arrangement when two 
couplers are used.” In this case, the receiver can always be on because 
it does not receive light from the transmitter. 
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Fig. 5—Maximum lightguide loss for one-coupler nodes. 


The coupler used in both cases can be characterized by a four-port 
device with a transmission coefficient a, a coupling coefficient 8, and 
an excess loss coefficient y = a + 8. Couplers with y better than —1 dB 
have been reported in the literature.* 

In the case of Fig. 1, Pin is the light entering the coupler from the 
ring; P; is the light entering the coupler from the transmitter; aPin + 
BP; is the amount of light entering the receiver, P,; aP; + BPin is the 
amount of light leaving the coupler and going into the ring, Pou. And 
in the case of Fig. 3, P, = aiPin, and Pou = a2P; + Bif2Pin. These two 
node configurations will be analyzed next. 


3.1 One-coupler nodes 


Let us consider first the case of one-coupler node and analyze a 
hypothetical network where @ adjacent nodes have failed and they 
are off. The power received after Q failed nodes is P, + Pi; Ps is the 
power received from the closest active transmitter: 


P, = P,L®'B%o’, (1) 
and P; the sum of all the powers received from all the other previous 


active transmitters that may cause interference: 
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Fig. 6—Optimum coupling coefficient for one-coupler nodes. 
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In eqs. (1) and (2), LZ is the average lightguide attenuation between 
two adjacent nodes, and the infinite summation accounts for the worst 
case of interference. 

The sensitivity and the interference constraints are satisfied when 
the effective received power, P,e(Q), is larger than the sensitivity of 
the receiver, S, 


(2) 


P,-(Q) = Ps; — P;= S; (3) 


the minus sign accounts for the reduction in the opening of the eye 
diagram caused by the interference. 
Equations 1 and 2, and a = y — £ are used to rewrite eq. 3 as 
1 — 2BL S 
L®+1g%y — By? ——| >=), 4 

B(y oe P, (4) 
which limits the values of 8 and L that satisfy the network constraints. 
In eq. 4, M is the optical power margin between the transmitter and 
the receiver. 
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Fig. 7—Feasibility graph for two-coupler nodes. 


The interference is deterministic, and it will not be seen by the 
comparator in the receiver because the Acc circuit sets the threshold 
automatically to one half of the peak amplitude which is precisely in 
the middle of the eye pattern. In addition to satisfying the sensitivity 
constraint expressed by eq. 3, one should have a signal-to-interference 
ratio (SIR) greater than 2 to eliminate any possible error caused by 
variations in the pulse amplitude, 





> 2. (5) 


The sir value of 2 was selected experimentally as the value where the 
error rate doubles. 

Equations 4 and 5 are used to find the minimum value of L for a 
given B. This is done by defining a variable V = BL that allows us to 
express L and £ as a function of V: 


Rae F(V) + [F°(V) + 2VF(V)/y]!” 


<i~x 


B mm I (6) 
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Fig. 8—Maximum lightguide loss for two-coupler nodes. 


where 
1-V 
2My7(1 — 2V)V®" 


Equation 5 restricts the possible values of V to be within the range 
from 0 to 0.33. 

Figure 4 shows a relation between L and £ as given by eq. 6 for 
values of Q@ = 0, 1, 2, and 3, y = —1 dB, and M = 30 dB. Values L and 
B are generally expressed in decibels, and L is commonly called the 
lightguide loss. 

We can define an optimum coupling, £., as the value of 8 that allows 
the maximum lightguide loss, L,,, for a given power margin M. Figures 
5 and 6 show the values of maximum L,,, and 8, as a function of M, 
and for a y = —1 dB. 

Figures 4 and 5 also show that the use of the fail-safe nodes in the 
network reduces the maximum lightguide loss between repeaters. This 
fact cannot be tolerated in transmission systems, but it may be possible 
in local area networks where the lightguide loss may not be a limiting 
factor. 

Figure 4 shows that the maximum lightguide loss L,, is not sensitive 


F(V) = 
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Fig. 9—Optimum coupler coefficient for two-coupling nodes. 


to variations of 8 within +1 dB from the optimum value of £, B.. From 
Fig. 6, one could establish that a coupler with 6 between 4 and 6 dB is 
adequate for different values of M and Q. 


3.2 Two-coupler nodes 


The analysis of a network with two-coupler nodes is similar to the 
case of one-coupler nodes substituting B by f” in eqs. 1, 2, and 5. Note 
that the expression of a = y — 8 remains the same for both cases. In 
this case, 8 and L are related by the expression 








1-—2B°L|_S 
L 28294 — py? i 
B'*(y — B) [— BL =p, (7) 
for the sensitivity constraint, and 
1—- BL 
SIR = a =2 (8) 


for the interference constraint. Figure 7 shows a relation between L 
and £ as given by eqs. (7) and (8) for values of Q = 0, 1, and 2, y = —1 
dB, and M = 30 dB. Figures 8 and 9 show the values of L,, and £, as 
a function of M when y = —1 dB. 
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A comparison of Figs. 7, 8, and 9 against Figs. 4, 5, and 6 indicates 
that there is a 3-dB penalty when using nodes made of two couplers 
instead of one. 


iV. CONCLUSIONS 


Two configurations of optical couplers were analyzed to provide 
continuity in an optical network in the case of a power failure at 
several consecutive nodes. The analysis determines the optimum cou- 
pling coefficient, and the maximum lightguide loss that a network can 
have. Optical networks with fail-safe nodes are of interest in local area 
networks where the lightguide transmission loss is substantially less 
than the optical power margin between the transmitter and the re- 
ceiver. 
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Fabrication and Properties of Single-Mode 
Optical Fiber Exhibiting Low Dispersion, Low 
Loss, and Tight Mode Confinement 
Simultaneously 


By A. D. PEARSON, P. D. LAZAY, and W. A. REED 
(Manuscript received October 5, 1981) 


Single-mode fiber with a new index profile design has been fabri- 
cated. The design allows the decoupling of bandwidth related factors 
from considerations which affect curvature-induced losses. 


I. INTRODUCTION 


In a previous publication, Lazay et al.’ reported a single-mode fiber 
design with a new refractive index profile that allows the wavelength 
of zero total dispersion to be positioned in the vicinity of 1.31 pm, while 
simultaneously providing low loss and tight mode confinement. This 
report will describe the fabrication and properties of fibers of this new 
design. 

The design uses a heavily fluorine-doped phosphosilicate cladding 
and a germania-doped silica core. The core and cladding compositions 
were chosen to provide a reduced material dispersion that cancels the 
waveguide dispersion near 1.3 ym. At the same time, the total core-to- 
clad index difference was chosen to give a delta of 0.5 percent, sufficient 
to produce a small modal size and excellent resistance to curvature- 
induced losses. We anticipate that the losses ultimately achievable 
with this new design will be lower than those of step-index GeOQ2-SiO2 
core fibers of the same core diameter, largely because of the reduced 
contribution from Ge scattering. Ainslie et al.? have pointed out the 
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loss advantage to be gained by the use of a low refractive index 
cladding and moderately doped core. 

Previous single-mode fiber designs which we have studied were 
based on either a low A (~0.20 percent) and large core diameter (~10 
pm) or a high A (~0.5 percent) and a small core diameter (~7.5 ym). 
The first class of fibers is capable of having zero dispersion wavelengths 
near 1.3 pm but has exhibited sensitivity to cabling induced loss 
because of mode confinement problems. The second class of fiber has 
exhibited no cabling induced loss but has a zero dispersion wavelength 
near 1.35 ym and, consequently, the dispersion near 1.3 ym is too large 
for high data rate (274 mb/s and above) undersea systems having 
repeater spacings of 30 to 50 km.*“ 


ll. DESIGN 


The basic goal is to make a fiber with a lightly GeO. doped SiOz 
core such that the material dispersion contribution will cancel the 
waveguide dispersion at or very near to the system operating wave- 
length, which is expected to be around 1.3 ym.‘ At the same time, it is 
imperative to retain the relatively small core (e.g., 7.5 to 8 wm) and 
large A in order to have low bending-induced loss. This is accomplished 
by providing a phosphosilicate cladding heavily doped with fluorine. 
The negative A~ of the cladding, combined with the positive A* of the 
core, both relative to silica, gives the required total A value of about 
0.5 percent. With a core diameter of 7.5 to 8 xm, this design provides 
excellent resistance to bending-induced loss, and cutoff wavelengths of 
around 1.2 ym. Most importantly, since the core and cladding doping 
levels can be varied independently, the core composition can be 
adjusted to make the wavelength of zero total dispersion fall at or very 
close to the projected system operating wavelength. 

Phosphorous doping of the cladding was necessary to provide rea- 
sonable processing temperatures. Although the P.O; raises the refrac- 
tive index slightly, the F doping depresses it much more, without 
substantially affecting the processing temperature. The reduction in 
refractive index of silicate glasses when fluorine replaces oxygen was 
discovered by Schott and Abbe in the late nineteenth century,” and 
the use of fluorine doping to reduce the refractive index of silicate 
compositions in optical fibers has been reported previously.®”*? 


lll. FIBER FABRICATION 


The preform was made by Modified Chemical Vapor Deposition 
(MCVD) in a 19- by 25-mm Heraeus T08-W6G silica tube. The delivery 
rates of the reactants for the cladding (SiCl,, POCls, and CF2Cl.) were 
chosen to give a glass deposition rate of about 0.35 g/min and a A™ of 
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—0.185 percent. The cladding was deposited in 16 passes without 
pressurization. The core was deposited in two passes using SiC, and 
GeCl, with a A* of +0.315 percent. A compensated collapse procedure 
was used, but no other chemical drying agents were used. After 
drawing, the fiber dimensions were OD 114 ym, core diameter 7.5 pm, 
deposited cladding diameter 44 um, and length 1 km. 


IV. CHARACTERIZATION OF FIBER 


Cutoff was determined, using the method described by Lazay,”° as 
the location of the rapid drop in power transmitted through a 3-meter 
length of fiber as the wavelength of the incident light was increased. A 
well-defined cutoff was located at A, = 1.192 + .005 pm. 

The loss spectrum was measured from 1.0 to 1.7 um using the far- 
end/near-end technique with a 3-meter near-end length. The loss was 
measured with and without a single 40-mm radius loop in the near-end 
length. Figure 1 shows the loss curve measured with the loop. Quite 
surprisingly, the loss curve without the loop was essentially identical, 
even in the vicinity of cutoff at 1.19 um. It has been our experience 
that this is a signature of very good mode confinement. The loss has 
a local minimum at 1.30 pm of 0.57 + .03 dB/km, and a minimum loss 
of 0.40 dB/km at 1.50 um. Beyond 1.5 ym, the loss rises rapidly and all 
evidence indicates that the loss is unbounded. This loss “edge” occurs 
when the effective mode index (the propogation constant divided by 
27/A) falls below the index of the substrate tube. When this happens, 
the mode becomes cut off because the power can leak through the 
cladding and be lost by the process of radiation." The wavelength at 
which the loss rises can be moved to longer wavelengths by increasing 
the thickness of the deposited cladding. In any case, the loss at 1.3 um 
is not adversely affected. 

The total chromatic dispersion in the single-mode regime was cal- 
culated from the derivative of group delay versus wavelength data.” 
These data were obtained using narrow pulses generated by stimulated 
Raman scattering in a single-mode fiber pumped with 1.06-um pulses 
from a mode-locked Q-switched Nd:YAG laser. The wavelength of the 
pulses emerging from the Raman fiber was selected with a grating 
monochromator. Figure 2 shows the spectral dependence of the dis- 
persion. The zero dispersion wavelength, Xo, is located at 1.312 um. 


V. CONCLUSIONS 


We have made a fiber that implements a new design. This design 
allows dispersion optimization through the manipulation of core and 
cladding glass compositions, while providing low curvature-induced 
losses through proper choice of core diameter and A. The ability to 
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Fig. 1—Loss versus wavelength for depressed index single-mode fiber. The error bars 
represent 3o. 
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Fig. 2—Total dispersion versus wavelength for depressed index single-mode fiber. 
The inset shows the idealized refractive index profile. 
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largely decouple bandwidth design from loss related requirements is a 
new and unique feature of this design. 

It is anticipated that further reduction in loss will be possible. A 
more complete study of the curvature-induced loss sensitivity of this 
fiber is underway. 
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